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DESCRIPTION* 

COIVIPOSITIONS AND 1METHODS RELATE!* TO FLAVIVIRUS ENVELOPE 
PROTEIN DOMAIN III ANTIGENS 

BACKGROUND OF THE INVENTION 

This application claims priority to U.S. Provisional Patent Applications serial numbers 
60/403,89 3 filed on August 16, 2002 and 60/445,581 ifiled February 6, 2003 , each of which is 
incorporated in its entirety herein by reference. 

Tfcie government may own rights in the present invention pursuant to contract number 
U90/CCO61 8754-01 from U.S. Department of Health and Human Services Centers for Disease 
Control. 

1 _ Field of the Invention 

TGhe present invention relates generally to tfcie fields of virology, immunology and 
diagnostics. More particularly, it concerns antibodies directed to and antigens derived from 
flavivirus envelope protein domain 1TJ in compositioris and methods for detection of various 
members of the genus flavivirus. 

2, . Description of Related Art 

West Nile virus (WN) is a member of the Japanese encephalitis (JE) serocomplex of the 
genus Flavivirus (Family Flaviviridae). This virus wa.s first isolated from a febrile woman in the 
West Nile province of Uganda in 1937, and now has an almost worldwide distribution including 
parts of Africa, Asia, Europe and, most recently, NortLr America. Kunjin virus, now re-classified 
as a sub-type of West Nile virus, is found in Australasia. 

Since 1999, the United States has experienced annual epidemics of WN disease in 
humans and animals over an expanding geographical range. WN virus has- been isolated in 44 
states, and more than 4,100 cases of human disease resulting in 284 deathts had been reported 
during 2002 (MMWR , 2002a). Several of these cases are suspected to bxave originated from 
virus transmitted during blood transfusion and/or organ transplantatiorjL (MMWR, 2002b). 
Outbreaks of WN disease with neurological manifestations have also been reported in Eastern 
Europe^ North Africa and Israel since the mid-1990s Cjeviewed by Murgue et al, 2002). 
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Other members of the JE serocomplex include JE virus, found throughout Asia, St. Xouis 
encephahtis (SUB) virus, found in the Ajmericas, and Murray Valley encephalitis (MVE) virus, 
found in Australia, and New Guinea. Tfciese viruses are antigenioally similar to WN virus, and 
their co-circulation in several regions of the world has complicated the specific diagnosis of 
infections by these viruses in humans and other hosts (Fonseca et al, 1991; Martin et al, 2002). 
Current protocols for the serological diagnosis of WN virus infection in the United States rely 
primarily on preliminary screening for WN virus-reactive IgM/IgG antibody by capture ELISA 
and confirmation by plaque reduction neutralization test (PRNT) (CDC, 2001), a process which 
results in considerable delays in the reliable reporting of accurate case numbers, and requires the 
confinnatory testing to be performed in specialized laboratories. 

Current diagnostic assays utilize either ELISA or dipstick formats for identification of 
flavivirus infection (PanBio, Integrated Diagnostics (Dobler et al, 1996, Niedrig et al.^ 2001, 
Yoshii et al, 2O03)). A number of assays are available for the detection of dengue virus 
infection. These assays utilize antigen, capture and antibody-based ELISAs and dipsticks for 
detection of virus specific IgG or IgM. Diagnosis of TBE infection depends on IgG-based 
ELISA assays that are available in Europe (Dobler et al, 1996, Niedrig et al, 2001, Yoshii et al, 
2003). However, these tests have limitations with both sensitivity and cross-reactivity with other 
flaviviruses (Niedrig et al, 2001). 

The recent utilization of subviral particles (SVP) in an ELIS A-based diagnostic test for 
tick borne encephalitis TBE infection shows promise (Yoshii et al, 2003). Since this ass-ay uses 
intact viral M and E proteins it is likely that the pitfalls that affect the use of complete viral 
antigen (e.g., cross-reactivity) may impede the employment of this assay in diagnostic settings. 

The use of RT-PCR is also a potential method for diagnosis of flavivirus irxfection. 
However, RT-PCR assays have the significant hmitation of requiring advanced tectmiques, 
equipment and areagents that require a cold-chain for stability. In addition, RT-PCR detects the 
presence of virus in patient serum, a condition that is not usually met when patients come to a 
hospital as the virus is frequently cleared from the bloodstream by the onset of symptoms. 
Clearly, there is a need to improve the current reagents used for diagnosis of West Nile and TBE 
virus infections - 
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SUMMARY OF THE INVENTION 

Embodiments of the invention include the use of recombinant envelope protein domain 
HI (rDK or rD3) derived from West Nile virus (WN), tick: borne encephalitis serocomplex 
viruses (TBE), and/or other flaviviruses as a reagent(s) to detect the presence of anti-WN or anti- 
TBE antibodies in a subject, e.g., naturally infected primates, including humans. Certian 
embodiments include polypeptides derived from WN rDIH ttiat are sensitive and very specific 
for WN virus infection and can also differentiate between closely related mosquito-borne 
flaviviruses. Some embodiments of the invention include the use of poly-peptides deriv-ed form 
TBE rDIII (rD3) as a diagnostic antigen to the TBE serocomplex of flaviviruses. While 
differentiation between the very similar TBE viruses could not be achieved, some of the 
polypeptide reagents were highly specific for the tick-bome flaviviruses and were rau_ch more 
specific than mouse brain-derived viral antigen in differentiating flavivirus positive seaca in the 
ELISA format. 

The development of a specific and sensitive diagnostic assay for detection of flavivirus 
infection will greatly enhance the ability to identify, track, and treat diseases caused by these 
viruses. The present invention takes advantage of the observation that a flavivirus envelope 
protein domain III (Dili) antigen can be used to specifically detect serocomplexes of jflavivirus 
and antibodies against certain serocomplexes or certain flaviviruses, e.g., West Nile -'virus. In 
addition, trie present invention takes advantage of the observation that certain West Nile virus 
envelope protein domain ITI (WN-DIII) antigens can be used to specifically detect West Nile 
virus and antibodies against West Nile virus. Various embodiments of the invention are directed 
to compositions and methods related, to detecting West Nile virus or TBE serocomplex -viruses or 
antibodies in a subject, patient, animal, biological or other type of sample. 

The present invention includes compositions and methods for the detection or diagnosis 
of flaviviros, TBE viruses or West Nile virus. Recombinant West Nile virus envelope protein 
domain m (WN-rDIII) or a recombinant TBE serocomplex virus envelope protein domain III 
(TBE-rDITJ) can be expressed in E. coli as a fusion protein to produce a soluble protein that can 
be purified. Rabbit antisera raised against WN-rDHI or TEtE-rDIII shows virus or serrocomplex 
specificity, respectively, in physical and biological assays. Removal of a non-viral fusion 
component typically improves the specificity and signal intensity for WN-rDIII or TBE-rDIIL 

In certain embodiments of* the invention, methods for screening for a flavivirus in a 
subject include a) contacting a sample from the subject: with a composition comprising a 
flavivirus envelope protein domain ITI polypeptide under conditions that permit fozrmation of 
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specific inraiunocomplex between any antibody in the sample and the envelope protein domain 
m polypeptide; and b) detecting whether a specific imrnunocomplex is formed. An envelope 
protein domain III polypeptide refers to a polypeptide including the amino acids that define 
domain in, a structural element of the flavivirus envelope protein, for example amino acid 
sequences 292 to 402 of SEQ ID NO:3, amino acid sequences set forth in SEQ 3D NO:4-21 or 
homologous sequences from other flaviviruses. Homologous envelope protein domain III 
sequences from other flavivirus typically have an identity of at least 70, 75, SO, 85, 90, 95 
percent or greater to the amino acid sequence 292-402 set forth in SEQ ID NO: 3 or the amino 
acid seqeunces set forth in SEQ ID NO:4-21. Additionally, a specific immunocomplex refers to 
a complex between a polypeptide containing an epitope recognized by an arrtibody and the 
antibody that recognizes the epitope where the complex can be detected and distinguish above 
any non-specific or background interactions. The enveLope protein domain HI polypeptide may 
be a dengue virus envelope protein domain III polypeptide, yellow fever virus envelope protein 
domain m polypeptide, West Nile virus envelope protein domain III polypeptide, St. Louis 
encephalitis virus envelope protein domain III polypeptide, Murray valley en_cephalitis virus 
envelope protein domain m polypeptide, a Central European encephalitis (CEE> virus envelope 
protein domain III polypeptide, a Russian spring-sumrjaer encephalitis (RSSE) virus envelope 
protein domain m polypeptide, a Langat (LGT) virus envelope protein domain III polypeptide, a 
Powassan virus (POW) envelope protein domain III polypeptide, an Alkhurma CALK) envelope 
protein domain IE polypeptide, a Kyasanur Forest disease (KFD) virus envelope protein domain 
in polypeptide, an Omsk hemorrhagic fever (OEDF) virus envelope protein domain III 
polypeptide or a combination or variant thereof. In particular embodiments, the envelope protein 
domain III polypeptide is a West Nile virus enevelo>pe protein domain HI polypeptide or a 
variant thereof. In other embodiments, the envelope protein domain III polypeptide is derived 
from a CEE or a RSSE enevelope protein domain IO polypeptide or a variant thereof. The 
envelope protein domain III polypeptide may include 5 , 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 
18, 19 , 20, 25, 30, 35, 40, 45, 50, 55, 60. 65. 70, 75, 80^ 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 
100, 101, 102, 103, 104, 105, 106, 107, 108, 109, or 1 10 contiguous amino acids of a flavivirus 
envelope protein domain III polypeptide or a variant thereof. It is contemplated that 1, 2, 3, 4, 5, 
6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more carboxy and/oi amino terminal amino acids flanking the 
envelope protein domain III may also be included in an envelope protein domain HI polypeptide. 
In certain embodiments, an amino acid sequence that is about or at least 50%, 55%, 65%, 70% 
75%, 80%, 85%, 90%, 95%, 99%, or any value therebewteen, identical to amino aicd 292-402 of 
SEQ ID NO:3 and/or SEQ 3D NO:8-21 is contemplated. A domain EI polypeptide may include 
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the amino acids 292-402 as set forth in SEQ ID NO:3, the amino acids 1-111 as set forth in SEQ 
ID NO:21, the amino acids as set forth in SEQ ID NO:4-20, or variants thereof. Some 
embodiments of the invention further comprise at least a second envelope protein domain HI 
polypeptide. A second envelope protein domain HI polypeptide may be selected from SEQ ID 
NO:3-21 or a similar sequence from other flaviviruses or closely related viruses. The envelope 
protein domain HI polypeptide may be prepared by isolating a recombinant or non-recombinant 
envelope protein domain III polypeptide. The envelope protein domain HI polypeptide may be 
denatured or non-denatured. In particular embodiments the envelope protein domain III 
polypeptide is prepared by isolating a recombinant envelope protein domain. Ill polypeptide 
fusion protein. In certain embodiments, a recombinant envelope protein domain EI polypeptide 
may be cleaved by an appropriate protease to separate the envelope protein domain III 
polypeptide from its viral or non-viral fusion partner <e.g., GST, his-tag or MBP). A envelope 
protein, domain EI polypeptide may be obtained from bacteria comprising an expression vector 
encoding the envelope protein domain EI polypeptide or envelope protein domain EI 
polypeptide fusion protein. The envelope protein donaain EI polypeptide or fixsion protein may 
be obtained from a mammalian or insect cell composing an expression vector encoding the 
envelope protein domain IE polypeptide or fusion protein. 

hi certain embodiments it is contemplated an. envelope protein domain IE polypeptide 
may be used in conjunction with 1, 2, 3, 4, 5, 6, or more additional antigens derived the same or 
other members of the flavivirus genus family. These polypeptides may be used in a variety of 
formats including, but not limited to ELISA and peptide array formats. 

In various embodiments, samples may be derived from a variety of subj ects infected with 
or suspected to be infected -with a flavivirus, including WN or a TBE serocornplex virus. The 
subjects include, but are not limited to an animal, a bird, a human, a mosquito, a tick or other 
host organism for a flavivinxs. 

The step of determining whether an immunoeomplex is formed may be accomplished by 
a number of ways well known to those of ordinary skill in the art. The immunocomplex may be 
detected by ELISA, Western blotting, dipstick or peptide array. In other embodiments, an 
immunocomplex is detected using anti-antibody secondary reagents. Anti-antibody secondary 
reagents refer to agents that specifically bind or detect an antibody. Compounds of the invention 
may ~be labeled with a detecting agent, which may be colorimetric, enzymatic, radioactive, 
chromatographic or fluorescent. The antigen may "be affixed to a solid non-reactive support, 
which refers to a compound that wiU not react with antigens of the invention or antibodies in any 
sample. The support may be a plate or assay dish, and be made of any non-reactive material, 
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including, glass, plastic, silicon or the like. An antibody may include, but is not limited to an 
IgA, an IgG or an IgM antibody. 

Various embodiments include methods of identifying a flavivirus in a subject comprising 
a) contacting a sample from the subject with a composition comprising a,t least one flavivirus 
envelope protein domain IB polypeptide under conditions that permit formation of specific 
immimocomplex between any antibody in the sample and the envelope protein domain m 
polypeptide; and b) detecting whether a specific irnrriunocomplex is formed . 

Certain embodiments of the invention include compositions for testing a sample for 
flavivirus or antibodies to flavivirus comprising an isolated flavivirus envelope protein domain 
m polypeptide. In particular embodiments, trie flavivirus envelope protein domain III 
polypeptide is a West Nile virus or a TBE sero complex virus envelope protein domain III 
polypeptide or variants thereof. A West Nile virus envelope protein domain III polypeptide may 
be derived from West Nile strains 382-99, EthAai4766, 385-99, Kunjhn MRM16, Golblum, 
TL44, DakAnMg, 804994 or a variant thereof, xvhich may be obtained through the World 
Arbovirus Reference Collection at the University- of Texas Medical Branch at Galveston or 
similar depositories such as the American Type Culture Collection. A TBE serocomplex virus 
may include a Central European encephalitis (CEE} virus, a Russian spring-summer encephalitis 
(RSSE) virus, a Langat (LGT) virus, a Powassan virus (POW), an Alkhurma (ALK), a Kyasanur 
Forest disease (BCFD) virus, or an Omsk hemorrhagic fever (OHF) virus, which may be obtained 
through the World Arbovirus Reference Collection at the University of Texas Medical Branch at 
Galveston or similar depositories such as the American Type Culture Collection. The 
composition may include a flavivirus envelope protein domain HI polypeptide, which may 
comprise 10, 11, 12, 13, 14, 15, 16, 17, 18 , 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 
85, 90, 95, 100, 110, or more, as well as values there between, of consecutive amino acids of the 
envelope protein domain in polypeptide or variants thereof. In particular embodiments, the 
composition may comprise the amino acid sequence as set forth in, or is about or at least 50%, 
55%, 65%, 70%, 75%, 80«3/o, 85%, 90%, 95%, 99<M>, or any value therebewteen, identical to, one 
or more of SEQ ID NO:3-21. The envelope protein domain HI polypeptide may be operatively 
linked to a substrate such as a plate, a microtiter plate, a bead, or a microarray. 

Embodiments of ttie invention also include compositions for testing a sample for West 
Nile virus or a TBE serocomplex virus comprising an isolated flavivirus or flavivirus envelope 
protein domain HI polypeptide as described above and incorporated here by reference. 

Embodiments of trie invention also include kits comprising any of the components of the 
invention described above, in a suitable container means. Kits may include one or more 
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flavivirus, TBE serocomplex virus or West Nile virus envelope protein domain HI antigeais. In 
still further embodiments, antigens are from the same or different strains. Such antigens may be 
in the same or in. separate compositions. Kits may further include non-reactive supports in. which 
antigens of the invention are affixed or attached. Kits may also include secondary antibody 
reagents and/or other detection reagents. Antigens or antibodies in the kits may be labeled. 
Labels may be colorimetric, enzymatic, radioactive, or fluorescent. The envelope protein 
domain III polypeptide may be a dengue fever virus envelope* protein domain in polypeptide, 
yellow fever virus envelope protein domain III polypeptide, West Nile virus envelope protein 
domain III polypeptide, St. Louis encephalitis virus envelope protein domain III polypeptide, 
Murray Valley encephalitis virus envelope protein domain III polypeptide, a Central European 
encephalitis (CIEE) virus envelope protein domain HI polypeptide, a Russian spring-summer 
encephalitis (R.SSE) virus envelope protein domain III polypeptide, a Langat (LGT7) virus 
envelope protein domain III polypeptide, a Powassan virus (POW) envelope protein domain EI 
polypeptide, an Alkhurma (ALK) envelope protein domain IO polypeptide, a Kyasanur Forest 
disease (KFD) virus envelope protein domain HI polypeptide, an Omsk hemorrhagic fever 
(OHF) virus envelope protein domain HI polypeptide or a combination thereof. In particular 
embodiments, the envelope protein domain III polypeptide is at West Nile virus envelopes protein 
domain III polypeptide. A kit may include compositions for screening for West Nile or TBE 
serocomplex virus antibodies in a subject comprising: a) an assay plate comprising a multiplicity 
of microtiter wells comprising a composition comprising at lea_st one envelope protein domain III 
polypeptide capable of binding a flavivirus antibody in the sample that can specifically b»ind to at 
least one envelope protein domain HI polypeptide; and b) a container means comprising a 
labeled secondary antibody having specific binding affinity for a flavivirus antibody in the 
sample that can specifically bind to at least one envelope protean domain III polypeptide. 

Embodiments of the invention also include methods of screening for flavivirus in a 
subject comprising: a) contacting a sample from the subject with a composition from the kit 
under binding conditions; and, b) detecting whether a specific immunocomplex is formed 
between an antibody and the at least one envelope protein domain III polypeptide. 

Various embodiments of the invention include va_ccine compositions comprising a 
flavivirus, TBE serocomplex or West Nile envelope protein domain HI polypeptide as described 
herein. The vaccine composition may further comprise an adjuvant(s) and an excipienO) known 
in the art. 

Other embodiments of the invention include an antibody or antibodies that selectively 
bind to an epitope in a envelope protein domain HI of a flavivirus, TBE serocomplex or West 
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Nile virus envelope protein. The epitope may be present in a West Nile or a TBE serocomplex 
envelope protein domain m polypeptide or a variant thereof. 

It is contemplated that any embodiment of a method or composition described herein can 
be implemented with respect to any other method or composition described herein. 

The use of the word "a" or "an" when used in conjunction with the term "comprising" in 
the claims and/or the specification may mean "one," but it is also consistent with the meaning of 
"one or more," "at least one," and "one or more than one." 

The use of the term "or" in the claims is used to mean "andL/or" unless explicitly indicated 
to refer to alternatives only or the alternative are mutually exclusive, although the disclosure 
supports a definition that refers to only alternatives and "and/or." 

Other objects, features and advantages of the present irrvention will become apparent 
from the following detailed description. It should be understood, however, that the detailed 
description and the specific examples, while indicating specific embodiments of the hrvention, 
are given by way of illustration only, since various changes and modifications within the spirit 
and scope of trie invention will become apparent to those skilled in the art from this detailed 
description. 

BRIEF DESCRIPTION QV THF, DP A-W¥NfiK 

The following drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better understood by 
reference to one or more of these drawings in combination with the detailed description of 
specific embodiments presented herein. 

FIG. 1 illustrates an exemplary amino acid alignment of envelope protein doamain Ills 
from various flaviviruses. 

FIG. 2 illustrates a two-dimensional schematic of ttxe topology and structure of a 
flavivirus envelope protein. 

FIG. 3 illustrates the binding of rabbit antiserum raised against WN recombinant 
envelope protein domain in antigen to flavivirus envelope proteins in western blot assays with 
whole virus antigens of (1) WN, (2) TE, (3) SLE, and (4) MVE viruses. 

FIG. 4 illustrates Western blot analysis or WN envelope protein domain IH specific 
monoclonal antibodies 5H10, 3A3, 7H2, 5C5, 3D9, and a polyclonal antiserum to WNT envelope 
protein domain III. 

FIG. 5 illustrates the results of an exemplary PRNT assay showing the neutralization 
activity of rabbit anti-envelope protein domain m sera. 
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FIG. 6 illustrates an envelope protein domain KI amino acid sequence variations for ten 
West Nile virus strains, and representative JE (Genbank accession U21057), SLE (Oenbank 
accession Ml 66 14) and MVE (Genbank accession M24220) viruses. Dots (.) indicate 
conservation with the West Nile virus strain 385-99 sequence?. Residues associated with escape 
from neutralization by Mabs or anti-envelope protein domain III serum for WN virus strains are 
shaded. 

FIG. 7 illustrates the binding of selected anti-flavivirus mouse immune ascitic fluids in 
an indirect EUSA protocol utilizing whole-virus JE serocoxnplex antigens (WN, JE, SLE, or 
MVE viruses) or recombinant WN envelope protein domain EL Error bars 1 standard deviation 
from the mean. 

FIG. 8 illustrates the binding of selected anti-flavivrrus mouse immune ascitic fluids in 
an indirect EUSA protocol utilizing whole-virus JE serocomplex antigens (WN, JE = SLE, or 
MVE viruses) or recombinant WN envelope protein domadn IE cleaved from a GST fusion 
protein. 

FIG. 9A-9C illustrates the binding of selected anti-flavivirus mouse imimrne ascitic 
fluids in an indirect ELISA protocol utilizing WN rDIII cleaved from an maltose binding protein 
(MBP) fusion protein, MBP WN rDJU fusion protein at 35 rig/well, and MBP WN rDJH fusion 
protein at 17.5 ng/well. 

FIG. 10 Phylogentic analysis of the fiavivirus envelope protein domain III aomino acid 
sequence. Analysis was performed using maximum parsimony analysis. The tree was rooted 
using the non- vector borne Rio Bravo virus. 

FIG. 11 Western blot of recombinant DIE. Ten ng of purified recombinant Dili was run 
on 12% SDS-PAGE gels and transferred to nitrocellulose. Blots were probed with homologous 
or heterologous anti-DIII serum. Asibi, yellow fever type strain; 17D, yellow fever vaccine 
strain; WN, West Nile virus; KFD, Kyasanur Forrest disease virus; KUM, central European TBE 
strain Kumlinge; LGT, Langat; OHF, Omsk hemorrhagic disease virus; POW, Powassan virus. 

FIG. 12A-12F ELISAs using MIAF to detect virus derived antigen. Mouse brain virus- 
derived antigen was coated into 96 well plates at 1 HA unit per well and MIAF were tested in 
two-fold serial dilutions. Each value represents the mean, of duplicate wells. The; legend in 
panel B is for all six panels. The tick-borne flaviviruses are represented by open symt>ols. 

FIG. 13A-13F ELISAs -using virus derived antigen to detect IgG in rabbit anti-DIII 
specific antiserum. Antigens were coated in the plates a,s 1 HA unit per well an.d anti-DIII 
specific sera were tested in two-fold serial dilutions. Each value is the mean of duplicate wells. 
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The legend refers to rabbit anti-Dill specific sera and the legend in panel A is for all panels. 
Tick-borne flaviviruses are represented by open symbols. ISTote scale differences in the Y-axis. 

FIG. 14A-14H ELISAs "using rDIII to detect IgG in rabbit anti-DIII specific antiserum. 
Recombinant rDIII was coated into plates at 20 ng per well and Din specific sera were tested in 
two-fold serial dilutions. Each "value is the mean of duplicate wells. The legend, for all panels 
refers to Dili specific sera and is presented in panel H. Tick-borne flaviviruses are represented 
by open symbols. Note scale differences in Y-axis. 

PIG. 15A-15H ELISAs using rDm to detect virus specific IgG in MIAF- Recombinant 
Dili was coated into plates at 20 ng per well and MIAF were tested in two-fold serial dilutions. 
Each value represents the mean of duplicate wells. The legend for all panels refers to MIAF and 
is presented in panel A. Tick-borne flaviviruses are represented by open symbols. Note scale 
differences in the Y-axis. 

FIG. 16 illustrates an exemplary amino acid alignment of envelope protein domain Ills 
from various flaviviruses. 

DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

Various embodiments of the invention include compositions and methods related to 
flavivinxs, TBE serocomplex flaviviruses (viruses) (TB-E) or West Nile virus (WN) envelope 
protein domain III (Dili or D3) or recombinant Dili (rDIII or rD3) as an antigen for specific 
diagnosis or detection of flavivirus, TBE serocomplex viruses and/or WN vims. The flavivirus 
envelope protein (E) is the major virion surface protein. It plays an important role in virus 
attachment and entry into host cells, and is also an important target for virus neutralizing 
antibodies (Sanchez and Ruiz, 1996; Mandl et at, 2000; Crill and Roehrig, 2001 ). The inventors 
describe the identification of residues associated with the neutralization of lineage I WN virus 
strain 385-99 (isolated in New York City in 1999) by monoclonal antibodies QVIAbs) that bound 
to DHL the putative receptor-binding domain, of the envelope protein. 

"Using these Din-reactive MAbs and a polyclona.1 serum generated against a recombinant, 
bacterially-expressed WN virus rDIII fragment, the antigenic relationships between WN virus 
strains representative of genetic lineages I and n haves been investigated and envelope protein 
domain IH residues that constitute subtype specific epitopes have been indentifled. 

The present invention includes compositions antd methods for the detection or diagnosis 
of a flavivirus, including compositions and methods for distinguishing Between different 
flaviviruses or groups of flavi-viruses. In particular embodiments, the flavivirus being detected is 
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the West Nile virus or a TBE serocomplex virus. Recombinant flavivirus, TBE virus or West 
Nile virus envelope protein, domain HI (rDHI) can be expressed in E. coli as a fusion protein to 
produce a soluble protein that can easily be purified. Rabbit antisera raised against a rDIII 
(rDHI) shows virus specificity in physical and biological assays. Removal of the fusion 
component improves specificity and signal intensity for a particular rDUI. 

The serological diagnosis of infection by flavriviruses can be complicated by the presence 
of flavivirus cross-reactive antibodies that produce false-positive results for rflavivirus infections, 
especially in regions where more than one virus is endemic. Current diagnostic reagents for tick- 
borne flavivirus infection liave been found to cross-react with yellow fever or dengue positive 
sera. In certain embodiments, recombinant fiavivinxs envelope protein domain III (rDIII or rD3) 
can be used as a diagnostic reagent to differentiate between infection by mosquito- and tick- 
borne flaviviruses. Embodiments of the invention also include the use of rDIII in an ELISA- 
based format for differentiation between serum specific for either mosquito- or tick-borne 
flaviviruses, which may or may not differentiate among the members of the tick-borne 
encephalitis (TBE) serocomplex of flaviviruses. Sera derived against several TBE serocomplex 
rDIII were found to cross-react with heterologous rDHI within the TBE serocomplex, but not 
with those from mosquito-borne flaviviruses, in both Western blots and ELISAs. Mouse 
hyperimmune serum generated against TBE serocomplex viruses was also found to react 
specifically with TBE serocomplex rDHI, but not -with rDIII from mosquito-borne viruses and 
vice versa. A similar test using virus-derived antigen was performed and a loss of both 
specificity and sensitivity was observed. These results indicate that flavivirus rDUI would be a 
useful reagent for the detection of infection by TBE serocomplex flaviviruses, several of which 
are potential biothreat agents, but may not provide the ability to differentiate between infections 
by separate members of trie serocomplex. 

I. FLAVIVIRUS 

West Nile virus and TBE viruses are members of the genus Flavivirus. The genus 
Flavivirus is a genera of the Flaviviridae family and includes the viral groups of Yellow Fever 
virus group, Tick-borne encephalitis virus group, Rio Bravo Group, Japanese encephalitis Group, 
Tyuleniy Group, Ntaya Group, Uganda S Group, Dengue Group, and Modoc Group. Members 
of the Flavivirus genus may produce a wide variety of disease states, such as fever, arthralgia, 
rash, hemorrhagic fever, and/or encephalitis. The outcome of infection is influenced by both the 
virus and host-specific factors, such as age, sex, genetic susceptibility, and/or pre-exposure to the 
same or a related agent. Some of the various diseases associated with members of the genus 
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Flavivirus are yellow fevex; dengue fever; and West Nile, Japanese, and St. Louis encephalitis. 
For a review of Flaviviruses see Burke and Monath (2001), which is incorporated herein by 
reference. 

Virions of the Flaviviridae generally contain one molecule of a linear positive-sense 
single stranded KNA genome of approximately 10,000-11,000 nucleotides that replicates in the 
cytoplasm of an infected cell. Typically the 5' end of the genome has a cap and the 3' end that 
may or may not have a poly (A) tract. Many menibers of the genus Flavivirus axe transmitted by 
a vector such as an insect, in many cases the insect is a mosquito. 

The viral genome of the Flavivirus genus is translated as a single polyprotein and is 
subsequently cleaved into mature proteins. The proteins encoded by the virus typically consist 
of structural and non-structural proteins. Generally, there are three structural proteins that 
typically include the envelope protein (E protein)(amino acids 275-787 of GenBank accession 
number NP_041724, incorporated herein by reference and SEQ ID NO:2), the core or capsid 
protein (C)(amino acids 1-92 of GenBank accession number NP 04-1724), and the pre- 
membrane protein (pxeM) (amino acids 105-223 of GenBank accession number 
NP_041724)(Yamshcliikov et al, 2001, incorporated herein by reference). The envelope protein 
is approximately 496 amino acids with an approximate molecular weight of 50 kDa and is often 
glycosylated. The envelope protein typically contains twelve conserved cysteine residues which 
form six disulfide bridges. The core protein is approximately 13 kDa and is rich in arginine and 
lysine residues. The pre-membrane protein is approximately 10 kDa and is cleaved during or 
after release of the virus from infected cells. A. cleavage product of the prM protein remains 
associated with the virion and is approximately S kDa and is termed the membrane protein (M). 
Typically, it is the carboxy terminus of prM that remains associated with the virus particle as the 
M protein. 

The flavivirus E protein is a dimer positioned parallel to virus surface. The ectodomain 
includes three domains I- Central domain (EI), II- Dimerization domain (EH), III- 
Immunogenic/Receptor binding domain (DUI) (FIG. 2). The amino acid sequence of an 
exemplary West Nile virus E protein Envelope protein domain III is set forth in SEQ ID NO:3. 
An amino acid alignment of various flavivirus Dills is presented in FIG. 1. The E protein 
envelope protein domain III is approximately 1 0.5 kDa with a single disulfide bridge. The E 
protein envelope protein domain III has an Ig-like fold, which is a p-banrel "type" configuration 
with no a-helices. Some flavivirus E protein domain ttTs contain a RGD integrin-binding motif. 
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Serological comparisons of West Nile virus strains have distinguished four major 
antigenic subtypes: a group of strains from Africa; strains from Europe and some Asian strains; 
strains from India; and strains of Kunjin virus from Australasia (Doherty et al, 1968; Hammam 
et al, 1966; Blackburn et al, 1987; Calishere* al, 1989; Morvan et al, 1990). Subsequently, 
analyses of nucleotide sequences identified two major genetic lineages 3 designated I and II, 
which included some subtypes and which correlated well with the antigenic groupings. Genetic 
lineage I included European and some African strains, Kunjin virus strains, and Indian strains; 
lineage II comprised only African strains (Lanctiotti et al, 1999; Jia et al., 1999; Scherret et al, 
2001). 

The TBE virus group that is associated, with human disease is distinct genetically and 
antigenically from the mosquito-bome viruses and are hence referred to as the TBE 
serocomplex. In addition to viruses that cause TBE, there are several o"ther viruses within this 
serocomplex. Among these are the Langat (LG-T) virus that is not known to infect humans in a 
natural environment, louping ill (LI) virus that causes encephaltitic disease normally in sheep, 
Powassan virus (VOW) that also causes encephalitis, and the hemorrhagic fever associated 
viruses Alkhurma (ALEC), Kyasanur Forest disease (KFD) and Omsk hemorrhagic fever (OHF) 
(Burke and Monath, 2001). Tick-borne encephalitis (TBE) is a disease endemic to vast areas 
from western Europe across Asia and into Japan and China. This disease is characterized by 
rapid onset of fever with subsequent development of potentially fatal encephalitis (Gritsun et al, 
2003). TBE found in Europe is typically less severe than that found in central and eastern Asia 
and the viruses that cause the different forms of the disease can be distinguished genetically and 
also by their tick vectors. Three subtypes of XBE have been described based on both serology 
and genetic data: central European encephalitis (CEE) (or western subtype), Siberian subtype 
TBE and Far-eastem subtype TBE (Heinz et al, 2000). The disease caused by the latter two 
subtypes are often commonly referred to as Hussian spring-summer encephalitis (RSSE). In 
addition, OHF, KFD and RSSE viruses are listed as potential biothrea-t agents by the National 
Institutes for Health and Centers for Disease Control. The possible introduction of these viruses 
by natural or artificial means into non-endemic areas, as well as the present extensive endemic 
regions, make the diagnosis of infection by these viruses a major publdc health objective. The 
lack of simple and accurate diagnostic assays makes the development of a TBE serocomplex 
diagnostic kit very important to rapid recognition of the causative agent of disease. 

Various members of the Flaviviridae family are available through the American Type 
Culture Collection (Manassas Va.) under the following ATCC numbers; Dengue type 1 (VR-71), 
Ilheus (VR-73), Japanese encephalitis (VR-74), Murray Valley encephalitis (VR-77), Ntaya 
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(VR-78), St. Louis encephalitis (VR-80), Uganda S (VR-81), West Nile (VR-82), Zika (VR-84), 
Dengue type 4 (VR-217), Dengue type 2 (VR-222), Japanese encephalitis (VR-343), Dengue 
type 1 (VR-344), Dengue type 2 (VR-345), Edge hill (VR-377), Entebbe bat (VR-378), 
Klokobera (VR-379), Stratford (VR-380), Tenibusu (VR-381), Dakar bat (VR-382), Maya (VR- 
78), Banzi (VR-414), Modoc (VR-415), Rio Bravo virus (VR-416), Cowbone ridge (VR-417), 
Bukalasa (VR-418), Montana myotis leukoencephalitis (VR-537), Bussuquara (VR-557), Sepik 
CVR-906), Cowbone ridge (VR-1253), Dengue type 2 (VR-1255), Dengue type 3 (VR-1256), 
Dengue type 4 (VR-1257), Ilheus (VR-1258), Rio Bravo virus (VR-1263>, St. Louis encephalitis 
CVR-1265), West Nile CVR-1267), Dengue type 4 (VR-1490), West Nile (VR-1507), and West 
3Mile (VR-1510), each of which is incorporated herein by reference. 

II. PROTEINACEOUS COMPOSITIONS 

hi various embodiments of the invention Flavivirus, TBE virus or West Nile virus 
polypeptides or proteins may be comprised in various proteinaceous- compositions. These 
proteinaceous composition may be used in the detection of flavivirus members, vaccination_ 
against flavivirus members, as well as other methods and compositions described herein. 

A. Proteinaceous Compositions 

hi certain embodiments, the present invention concerns novel compositions comprising at 
least one proteinaceous molecule, such as a riDffl polypeptide (antigen) alone or in combination* 
with other flavivirus envelope proteins, envelope protein domain Ills ot fragments thereof. As 
used herein, a "proteinaceous molecule," "proteinaceous composition," "proteinaceous 
compound," "proteinaceous chain" or "proteinaceous material" generally refers, but is not 
limited to, a protein of greater than about 200 amino acids or the full length endogenous 
sequence translated from a gene; a polypeptide of greater than about 1O0 amino acids; and/or a. 
peptide of from about 3 to about 100 amino acids. All the "proteinaceous" terms describeed 
above may be used interchangeably herein. The term "antigen" refers to any substance o>r 
material that is specifically recognized by axi antibody or T cell receptor. The term "epitope:" 
refers to a specific antigenic determinant that is recognized by an antibody or T cell receptor. 
Thus, it is contemplated that the antigens of the invention may be truncations or only portions of 
a full-length polypeptide. For example, a "rDIII antigen" refers to n peptide or polypeptide 
containing contiguous amino acids of envelope protein domain IK, including at least one 
envelope protein domain III epitope, but it may be fewer than a full-length amino acid sequence. 
Thus, an envelope protein domain m antigen may include a region o:f contiguous amino acids 
derived from any of SEQ ID NO:3-21. 
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SEQ ID NO:2 corresponds to protein accession number TMP_041724, which is the 
sequence for a West Nile virus. SEQ ID N~0:3 corresponds to amino acids 291-787 of SEQ ID 
NO:2 , which is a full-length processed IE protein envelope protein domain HI polypeptide 
sequence, Immunogenic regions of flaviv-irus envelope proteins have been described, and the 
present invention includes antigens that include one or more such regions. 

In certain embodiments, a proteinaceous molecule comprisin.g a TBE serocomplex »irus 
or a West Nile virus envelope protein domain III antigen may comprise, be at least, or be at :most 
5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24-, 25, 26, 27, 28, 29, 30, 31, 
32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 5 0, 51, 52, 53, 54, 55, 56, 57, 
58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 
84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, S5, 96, 97, 98, 99, 100, lOl, 102, 103, 104, 105, 106, 
107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124^ 125, 
126, 127, 128, 129, 130, 140, 150, 160, 170, 180, 190, 200 or greater contiguous amino acid 
residues, and any range derivable therein of SEQ ID NO:2, or SEQ ED NO:3-21. 

As used herein, an "amino molecule" refers to any amino acid, amino acid derivative or 
amino acid mimic as would be known to ome of ordinary skill in the art. In certain embodiments, 
the residues of the proteinaceous molecule are sequential, without any non-amino molecule 
interrupting the sequence of amino molecule residues. In other embodiments, the sequence may 
comprise one or more non-amino molecule moieties. In particular embodiments, the sequence of 
residues of the proteinaceous molecule may be interrupted by one or more non-amino molecule 
moieties. 

Encompassed by certain embodiments of the present invention are peptides, such sis, for 
example, a peptide comprising all or part of a flavivirus envelope antigen (including at least one 
epitope) of any subtype or clade. Peptides of the invention may comprise, be at least, ox be at 
most 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 
30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 
56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, SO, 81, 
82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99,, 100, 101, 102, 103, 104, 105, 
106, 107, 108, 109, 110, 111 contiguous amino acids, including all or part of any of S-EQ ID 
NO:2-21. 

Accordingly, the term "proteinaceous composition" encompasses amino molecule 
sequences comprising at least one of the 20 common amino acids in naturally synthesized 
proteins, or at least one modified or unusual amino acid, including but not limited to those shown 
on Table 1 below. 
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TACBLE 1 
Modified and Unusual Amino Acids 


Abbr. 


Amino Acid 


Abbr. 


Amino Acid 


Aad 


2-Atninoadipic acid 


EtAsn 


N~Bthy"lasparagine 


Baad 


3- Aminoadipic acid 


Hyl 


Hydroxylysine 


Bala 


p-alanin_e, P-Amino-propionic acid 


AHyl 


allo-Hydroxylysine 




2-Aminobutyiic acid 


3Hyp 


3-Hydaroxyprolme 




4- Amiixobutync acid, piperidiiiic acid 


4Hyp 


4-Hydiroxyproline 




6-Aminocaproic acid 


Ide 


Isodesmosine 




2-Anun.oheptanoic acid 


Alle 


allo-Isoleucine 




2-Atnni.oisobutyric acid 


MeGly 


N-Methylglycine, 
sarcosine 


Baib 


3-Arrdn.oisobutyric acid 


Melle 


N-M^ethylisoleucine 


Apm 


2-Amirxopimelic acid 


MeLys 


6-N-]V3ethyllysine 


Dbu 


2,4-Diaminobutyric acid 


MeVal 


N-Methylvaline | 


Des 


Desmosine 


Nva 


Norvailine 


Dpm 


2,2'-Diaminopimelic acid 


Me 


Norlexicine 


Dpr 


2,3-Diaminopropionic acid 


Orn 


Ornithine 


EtGly 


N-Ethylglycine 







In certain embodiments the proteinaceous composition comprises at least one protein, 
polypeptide or peptide. In further embodiments the proteinaceous composition conxprises a 
biocompatible protein, polypeptide or peptide. As used herein, tibe term "biocompatible" refers 
to a substance which produces no significant untoward effects wtien applied to, or administered 
to, a given organism according to the methods and amounts described herein. Such untoward or 
undesirable effects are those such as significant toxicity or adverse immunological reactions. In 
preferred embodiments, biocompatible protein, polypeptide or peptide containing compositions 
will generally be viral proteins or peptides or synthetic proteins or peptides each essentially free 
from toxins, pathogens and harmful imrraunogens. 

Proteinaceous compositions may" be made by any technique known to those of skill in the 
art, including the expression of proteins, polypeptides or peptides through standard molecular 
biological techniques, the isolation of proteinaceous compounds from natural sources, or the 
chemical synthesis of proteinaceous materials. The nucleotide and protein, polypeptide and 
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peptide sequences for various genes have been previously disclosed, and may be fonnd at 
computerized databases known to those of ordinary skill in the art. One such databases is the 
National Center for Biotechnology Information's Genbank and GenPept databases 
(www.ncbinlm-nih.gov). The coding regions for these known genes may be amplified and/or 
expressed using the techniques disclosed herein or as would be know to those of ordinary skill in 
the art. Alternatively, various commercial preparations of proteins, polypeptides and peptides 
are known to those of skill in the art. 

In certain embodiments a proteinaceous compound may be purified. Generally, 
"purified" will refer to a specific protein, polypeptide, or peptide composition that has been 
subjected to fractionation to remove various other proteins, polypeptides, or peptides, and which 
composition substantially retains its activity, as may be assessed, for example, by the protein 
assays, as would be known to one of ordinary skill in the art for the specific or desired protein, 
polypeptide or peptide. In still further embodiments, a proteinaceous compound may be ^purified 
to allow it to retain its native or non-denatured conformation. Such compounds may be 
recombinantly derived or they may be purified from endogenous sources. 

In certain embodiments, the proteinaceous composition may comprise at least one 
antigen of a flaviviral envelope protein domain HI that is recognized by an antibody. As used 
herein, the term "antibody" is intended to refer broadly to any immunologic binding agent such 
as IgG, IgM, IgA, IgD and IgE. Generally, IgG and/or IgM are preferred because they are the 
most common antibodies in the physiological situation and because they are most easily made in 
a laboratory setting. 

The term "antibody" is also used to refer to any antibody-like molecule thazt has an 
antigen binding region, and includes antibody fragments such as Fab', Fab, F(ab> 2 , single 
domain antibodies (DABs), Fv, scFv (single chain Fv), and the like. The techniques for 
preparing and using various antibody-based constructs and fragments are well known in the art. 
Means for preparing and characterizing antibodies are also well known in the art (See, e.g., 
Harlow et al. , 1988; incorporated herein by reference). 

It is contemplated that virtually any protein, polypeptide or peptide containing 
component may be used in the compositions and methods disclosed herein. However, it is 
preferred that the proteinaceous material is biocompatible. In certain embodiments, it is 
envisioned that the formation of a more viscous composition will be advantageous in Chat it will 
allow the composition to be more precisely or easily applied to the tissue and to be maintained in 
contact with, the tissue throughout the procedure. In such cases, the use of ^ peptide 
composition, or more preferably, a polypeptide or protein composition, is contemplated. Ranges 
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of viscosity include, but are not limited to, about 40 to about 100 poise. In certain aspects, a 
viscosity of about 80 to about 100 poise is preferred. 

1. Variants of Flavivirus Envelope Protein Domain III Antigens 
Amino acid sequence variants of the polypeptides- of the present invention can be 
substitutional, insertional or deletion variants. Deletion variants lack one or more residues of the 
native protein that are not essential for function or immunogenic activity, and are exemplified by 
the variants lacking a transmembrane sequence described above. Another common type of 
deletion variant is one lacking secretory signal sequences or signal sequences directing: a protein 
to bind to a particular part of a cell. Insertional mutants typically involve the addition o-f material 
at a non-terminal point in the polypeptide. This may include the insertion of an immunoreactive 
epitope or simply a single residue. Terminal additions, called fusion proteins, are discussed 
below. 

Substitutional variants typically contain the exchange of one amino acid for smother at 
one or more sites within the protein, and may be designed to modulate one or more properties of 
the polypeptide, such as stability against proteolytic cleavag;e, without the loss of otherr functions 
or properties. Substitutions of this kind preferably are conservative, that is, one amino acid is 
replaced with one of similar shape and charge. Conservative substitutions are well known in the 
art and include, for example, the changes of: alanine to serine; arginine to lysine; asparagine to 
glutamine or histidine; aspartate to glutamate; cysteine to serine; glutamine to asp>aragine or 
histidine; glutamate to aspartate; glycine to proline; histidine to asparagine or glutamine; 
isoleucine to leucine or valine; leucine to valine or isoleucine; lysine to arginine; metliionine to 
leucine or isoleucine; phenylalanine to tyrosine, leucine or methionine; serine to threonine; 
threonine to serine; tryptophan to tyrosine; tyrosine to tryptophan or phenylalanine; and valine to 
isoleucine or leucine. 

The term "functionally equivalent codon" is used therein to refer to codons that encode 
the same amino acid, such as the six codons for arginine or serine, and also refers to codons that 
encode biologically equivalent amino acids (see Table 2, below). 

It also will be understood that amino acid and nucleic acid sequences may include 
additional residues, such as additional N- or C-terminal amino acids or 5' or 3' sequences, and yet 
still be essentially as set forth in one of the sequences disclosed herein, so long as tfcie sequence 
meets the criteria set forth above, including the maintenance of immunogenicity or antibody 
binding. The addition of terminal sequences particularly applies to nucleic acid sequences that 
may, for example, include various non-coding sequences flanking either of the 5' o:r 3' portions 
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of the coding region or may include various internal sequences, i.e., introns, which are known to 
occur within genes. 



TABLE 2 
Codon Table 



Amino Acids 


Codons 


Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


UUC 


uuu 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


he 


I 


AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


CUU 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


ccc 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


ucc 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GTJG 


GUU 






Tryptophan 


Tip 


w 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











The following is a discussion based upon changing of the amino acids of a protein to 
create an equivalent, or even an improved, second-generation molecule. For example, certain 
amino acids may be substituted for other amino acids in a protein structure without appreciable 
loss of interactive binding capacity with structures suchL as, for example, antigen-"binding regions 
of antibodies. Since it is the interactive capacity and nature of a protein tfcxat defines that 
protein's biological functional activity, certain amino acid substitutions can be made in a protein 
sequence, and in its underlying DNA coding sequence^ and nevertheless produce a protein with 
like pxoperties. It is thus contemplated by the inventors that various changes may be made in the 
DNA sequences of genes without appreciable loss of their biological utility- or activity, as 
discussed below. Table 2, above, shows the codons that encode particular amino acids. 

ha making such changes, the hydropathic index of amino acids may be considered. The 
importance of the hydropathic amino acid index in conferring interactive biologic function on a 
protein is generally understood in the art (Kyte and Doolittle, 1982). It is a.ccepted that the 
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relative hydropathic character of the amino acid contributes to the secondary structure of the 
resultant protein, which in turn defines the interaction of the protein with other molecules, for 
example, enzymes, substrates, receptors, DNA, antibodies, antigens, and the like. 

It also is understood in the art that the substitution of like amino acids can be made 
effectively on the basis of tiydrophilicity. U.S. Patent 4,554,101, incorporated herein by 
reference, states that the greatest local average hydrophilicity of a protein, as go-verned by the 
hydrophilicity of its adjacent amino acids, correlates with a biological property of the protein. 
As detailed in U.S. Patent 4,554,101, the following hydrophilicity values have been assigned to 
amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); glutamate (+3.0 ± 1); 
serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5 ± 
1); alanine (-0.5); histidine *-0.5); cysteine (-1.0); methuonine (-1.3); valine (-1.5); leucine (-1.8); 
isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). 

It is understood that an amino acid can be substituted for another having a similar 
hydrophilicity value and still produce a biologically equivalent and/or an immunologically 
equivalent protein. In such changes, the substitution of amino acids whose hydrophilicity values 
are within ±2 is preferred, those that are within ±1 are particularly preferred, arxd those within 
±0.5 are even more particularly preferred. 

As outlined above, amino acid substitutions generally are based on the relative similarity 
of the amino acid side-chain substituents, for example, their hydrophobicity, hydrophilicity,, 
charge, and size. Exemplary substitutions that take into consideration the various foregoing 
characteristics are well known to those of skill in the art and include: arginine and lysine; 
glutamate and aspartate; serine and threonine; glutamime and asparagine; and valine, leucine and 
isoleucine. 

Another embodiment for the preparation of polypeptides according to the invention is the 
use of peptide mimetics. Mirnetics are peptide-containing molecules that rnirnic elements of protein 
secondary structure. See e.g., Johnson (1993). The underlying rationale behind thte use of peptide 
mimetics is that the peptide backbone of proteins exists chiefly to orient amino acid side chains in 
such a way as to facilitate molecular interactions, such as those of antibody and antigen. A peptide 
mimetic is expected to permit molecular interactions similar to the natural molecule. These 
principles may be used, in conjunction with the principles outlined above, to engineer second 
generation molecules having many of the properties of flavivirus envelope protein domain UJ 
antigens, but with altered and even improved characteristics. 
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2. Fusion Proteins 

A specialized kind of insertion^ variant is the fusion protein. This molecule generally has 
all or a substantial portion of the native molecule, linked at the N- or C-tenrhhus, to all or a portion 
of a second polypeptide. For example, fusions typically employ leader sequences from other- 
species to permit the recombinant expression of a protein in a heterologous host. Another useful 
fusion includes the addition of a region to f acilitate purification of ttie fusion protein. Inclusior* of a 
cleavage site at or near the fusion junction "will facilitate removal of the extraneous polypeptide after 
purification. Other useful fusions include linking of functional domains, such as active sites from 
enzymes such as a hydrolase, glycosylation domains, cellular targeting signals or transmembrane 
regions. 

3. Protein Purification 

It is desirable to purify flavivirus envelope protein domaiix III antigens or variants thereof. 
These techniques involve, at one level, the crude fractionation of the cellular milieu to 
polypeptide and non-polypeptide fractions. Certain embodiments of the invention are directed at 
preserving the conformation of flavivirus envelope protein domain HI antigens as much as 
possible so that they are substantially non-denatured. 

Antigens of the invention may be purified using gentle, n_on-denaturing detergents, xvhich 
include, but are not limited to, NP40 and digitonin. Infected or transfected host cells irtay be 
solubilized using a gentle detergent. The following conditions are considered "substantially 
denaturing" or "denaturing": 10 mM CHAPS, 0.5% SDS„ >2% deoxycholate, or 2.0% 
octylglucoside. Antigens prepared under such conditions would not be considered "non- 
denatured antigens." Preparations of substantially non-denatured antigens of the invention! may 
be accomplished using techniques described in U.S. Patents 6,0^74,646 and 5,587,285, which are 
hereby incorporated by reference herein.. 

Certain aspects of the present invention concern thes purification, and in particular 
embodiments, the substantial purification, of an encoded protein or peptide. The term "p-urified 
protein" or "purified peptide" as used herein, is intended to refer to a composition, iso latable 
from other components, wherein the protein or peptide is purLfied to any degree relative to its 
naturally-obtainable state. A purified protein or peptide therefore also refers to a pro-tein or 
peptide, free from the environment in which it may naturally oc«cur. 

Generally, "purified" will refer to a protein or pep-tide composition that ha^s been 
subjected to fractionation to remove various other components, and which composition 
substantially retains its expressed biological activity. Where tfcie term "substantially purified" is 
used, this designation will refer to a composition in which the protein or peptide forms the major 

21 



WO 2004/016586 



PCT/US2003/025681 



component of th.e composition, such as constituting about 50^>, about 60%, about 70%, about 
80%, about 90%, about 95% or more of the proteins in the composition. 

Various methods for quantifying the degree of purification of the protein or peptide will 
be known to those of skill in the art in light of the present disclosure. These include, for 
example, determining the specific activity of an active fraction, or assessing the amount of 
polypeptides within a fraction by SDS/PAGE analysis. A preferred method for assessing the 
purity of a fraction is to calculate the specific activity of the fraction, to compare it to the specific 
activity of the initial extract, and to thus calculate the degree of purity, herein assessed by a 
fold purification number." The actual units used to represent the amount of activity will, of 
course, be dependent upon the particular assay technique chosen to follow the purification and 
whether or not the expressed protein or peptide exhibits a dete&table activity. 

There is no general requirement that the protein or peptide always be provided in their 
most purified state. Indeed, it is contemplated that less substantially purified products will have 
utility in certain embodiments. Partial purification may be accomplished by itsing fewer 
purification steps in combination, or by utilizing different forms of the same general purification 
scheme. Methods exhibiting a lower degree of relative purific ation may have advantages in total 
recovery of protein product, or in maintaining the activity of axi expressed protein. 
4. Antibodies 

The present invention concerns the detection of flav-ivirus, TBE serocomplex virus or 
West Nile virus antibodies using flavivirus, TBE virus or West Nile virus antigens. As used 
herein, the term "antibody" is intended to refer broadly to airy immunologic binding agent such 
as IgG, IgM, IgA, IgD and IgE. Generally, IgG and/or IgM are preferred because they are the 
most common, antibodies in the physiological situation and because they are most ea_sily made in 
a laboratory setting. As described earlier, an antigen may include one or more epitopes and an 
antigen refers to any part of a polypeptide that contains at least one epitope. 

The term "antibody" is used, to refer to any antibody-like molecule that ha.s an antigen 
binding region. The techniques for preparing and using various antibody-based constructs and 
fragments are well known in the art. Means for preparing and characterizing antibodies are also 
well known in the art {See, e.g. , Harlow and Lane, 1988; incorporated herein by reference). 

In addition to polypeptides, antigens of the invention may be peptides corresponding to 
one or more antigenic deterrninants of the flavivirus envelop e protein domain HI polypeptides of 
the present invention. Thus, it is contemplated that detection of a flavivirus, a TBE virus or 
West Nile virus antibody may be accomplished with a fla"vivirus envelope protein domain III 
antigen that is a peptide or polypeptide. 
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Such peptides should generally be at least five or six amino acid residues in length and 
will preferably be about 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25 or about 30 amino acid 
residues in length, and may contain up to about 35, 40, 45, 50, 55, 60, 65, 70, 75, 8a, 85, 90, 95 
100, 105, 1 10, 1 1 1 or more residues and values there between. For example, these peptides may 
comprise a AVN DIE antigen sequence, such as 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, IS, 17, 18, 19, 
20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 110 or more contiguous amino acids from any of SEQ 
ID NO:3 or 11; or a TBE-DHI antigen, such as 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, IS, 17, 18, 19, 
20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 1 10 or more contiguous amino acids from any of SEQ 
ID N0:14-20. Synthetic peptides will generally be about 35 residues long, which is the 
approximate upper length limit of automated peptide synthesis machines, such as those available 
from Applied Biosystems (Foster City, CA). Longer peptides also may be prepared, e.g., by 
recombinant means. 

U.S. Patent 4,554,101, incorporated herein by refexence, teaches the identification and 
preparation of epitopes from primary amino acid sequences on the basis of h^drophilicity. 
Through the methods disclosed, one of skill in the art would be able to identify epitopes and/or 
antigens from within an amino acid sequence such as a flavivirus, TBE virus or West Nile virus 
sequence disclosed herein in as SEQ ID NO:2, 3, 4, 5, 6, T, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18,19,20,21. 

Numerous scientific publications have also been devoted to the prediction of secondary 
structure, and to the identification of epitopes, from analyses of amino acid sequences (Chou and 
Fasman, 1974a, b; 1978a, b; 197S>). Any of these may be used, if desired, to supplement the 
teachings of Hopp in U.S. Patent 4^554,101. 

Moreover, computer programs are currently available to assist with predicting antigenic 
portions and epitopic core regions of proteins. Examples include those programs based upon the 
Jameson- Wolf analysis (Jameson and Wolf, 1988; Wolf «f ah, 1988), the program PepPlot® 
(Brutlag et aL, 1990; Weinberger et ah, 1985), and other new programs for protein tertiary 
structure prediction (Fetrow and Bryant, 1993). Anotner commercially available software 
program capable of carrying out sixch analyses is MacVector (JJ3I, New Haven, CT) . 

In further embodiments, major antigenic determinants of flavivirus, TBE or West Nile 
envelope protein domain III polypeptide may be identified by an empirical approach in which 
portions of the gene encoding a flavivirus, TBE or West Nile envelope protein(s) are expressed 
in a recombinant host, and the resulting proteins tested for their ability to elicit an immune 
response. Alternatively all or part of flavivirus envelope proteins from different subtypes or 
clades of different flaviviruses may be tested. A range of peptides lacking successively longer 



23 



WO 2004/016586 



PCT/US2003/025681 



fragments of the C-terminus of the protein can be assayed as long as the peptides are prepared to 
retain their structure as it would he in a native polypeptide. The irnmunoactivity of each of these 
peptides is determined to identify those fragments or domains of the polypeptide that are 
immuno dominant. Further studies in which only a small number of amino acids are removed at 
each iteration then allows the location of the antigenic determinants of the polypeptide to be 
more precisely determined. 

Once one or more such analyses are completed, polypeptides are prepared that contain at 
least the essential features of one or more antigenic determinants. The peptides are then 
employed in the generation of antisera against the polypeptide. Minigenes or gene fusions 
encoding these determinants also can be constructed arxd inserted into expression vectors by 
standard methods, for example, using PCR™ cloning mettiodology. 
5. Immunodetection Methods 

As discussed, in some embodiments, the present invention concerns irrununodetection 
methods for binding, pmifying, removing, quantifying and/or otherwise detecting flavivirus 
antibodies in a sample, particularly TBE virus or West Nile virus antibodies, using Dm antigens. 
The samples may be any biological fluid or tissue from a. patient or subject or animal host. The 
sample may be placed on a non-reactive surface such as a plate, slide, tube, or- other structure 
that facilitates in any way the screening of the sample for flavivirus antibodies. While samples 
may be individually screened, large numbers of samples may be screened, such as for detecting 
contamination in blood bank samples. 

Immunodetection methods include enzyme linked immunosorbent assay (ELISA), 
radioimmunoassay (RIA), irrununoradiometric assay, fluoroirnmunoassay, chemihuninescent 
assay, bioluminescent assay, and Western blot, though several others are well known to those of 
ordinary skill. The steps of various useful immunodetection methods have been described in the 
scientific literature, such as, e.g., Doolittle et al, 1999; Gulbis et al, 1993; De Jager et al, 1993; 
and Nakamura et al, 1987, each incorporated herein by reference. 

In general, the imnaunobinding methods include obtaining a sample suspected of 
containing a flavivirus, in particular a TBE virus or a West Nile virus antibody with a 
composition comprising a fla-vivirus, TBE virus or West Nile Dili antigen in accordance with the 
present invention under conditions effective to allow the formation of immunocomplexes. 

These methods include methods for purifying an antibody from bodily fluids, tissue or 
organismal samples. In these instances, the antigen removes the antibody component from a 
sample. The antigen will preferably be linked to a solid support, such as in the form of a column 
matrix, and the sample suspected of containing the antibody will be applied to the irnmobihzed 
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antigen. The unwanted components will be washed from the column, leading the antibody 
immunocomplexed to the immobilized antigen to be eluted. Alternatively, sandwich versions of 
this assay maybe employed. 

The immunobinding methods also include methods for detecting and quantifying the 
amount of an antibody component in a sample and the detection and quantification of any 
immune complexes formed during the binding process. Here, one would obtain a sample 
suspected of containing an axitibody and contact the sample with an antigen, and then detect and 
quantify the amount of immune complexes formed under the specific conditions. 

In terms of antigen detection, the biological sample analyzed may be any sample that is 
suspected of containing an antibody, such as, for example, a tissue section or specimen, a 
homogenized tissue extract, a cell, an organelle, separated and/or purified forms of any of the 
above antibody-containing compositions, or even any biological fluid that comes into contact 
with the cell or tissue, including blood and/or serum.. 

Contacting the chosen biological sample witbt the antigen under effective conditions and 
for a period of time sufficient to allow the formation of immune complexes (primary immune 
complexes) is generally a matter of simply adding the antigen composition*, to the sample and 
incubating the mixture for a period of time long enough for any antibodies present to form 
imm-une complexes with, i.e., to bind to, antigea.s . After this time, the sample-antibody 
composition, such as a tissue section, ELISA plate, dot blot or western blot, will generally be 
washed to remove any non-specifically bound antibody species, allowing only those antibodies 
specifically bound within title primary immune complexes to be detected. 

In general, the detection of immunocomplex formation is well known in the art and may 
be achieved through the application of numerous approaches. These methods are generally 
based upon the detection of a label or marker, such as any of those radioactive, fluorescent, 
biological and enzymatic tags. U.S. Patents concerning the use of such labels include 3,817,837; 
3,850,752; 3,939,350; 3,99 6,345; 4,277,437; 4,275,149 and 4,366,241, each, incorporated herein 
by reference. Of course, one may find additional advantages through the use of a secondary 
binding ligand such as a second antibody and/or a biotin/avidin ligand binding arrangement, as is 
known in the art. 

The antigen employed in the detection may itself be linked to a detectable label, wherein 
one would then simply detect this label, thereby allowing the amount of the primary immune 
complexes in the composition to be deterrnined. Alternatively, the first antigen that becomes 
bound within the primary immune complexes may be detected by means of a second binding 
ligand that has binding affinity for the antigen. In these cases, the second binding ligand may be 
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linked to a detectable label. The second binding ligand is itself often an antibody, which may 
thus be termed a "secondary" antibody. The primary immune complexes are contacted with the 
labeled, secondary binding ligand, or antibody, under effective conditions and for a period of 
time sufficient to allow the formation of secondary immune complexes. The secondary immune 
complexes are then generally washed to remove any non-specifically bound labeled secondary 
antibodies or ligands, and the remaining label in the secondary immune complexes is then 
detected. 

Further methods include the detection of primary immune complexes by a two step 
approach. A second binding ligand, such as an antibody, that has binding affinity for the 
antibody is used to form secondary immune complexes, as described above. After washing, the 
secondary immune complexes are contacted with a. third binding ligand or" antibody that has 
binding affinity for the second antibody, again under effective conditions and for a period of 
time sufficient to allow the formation of immune complexes (tertiary immurxe complexes). The 
third ligand or antibody is linked to a detectable label, allowing detection of "the tertiary immune 
complexes thus formed. This system may provide for signal amplification if this is desired. 
a. ELISAs 

As detailed above, immunoassays, in their most simple and/or direct sense, are binding 
assays. Certain preferred immunoassays are the various types of enzyme linJked immunosorbent 
assays (ELISAs) and/or ra-dioimmunoassays (RIA.) known in the art. Incimunohistochemical 
detection using tissue sections is also particularly useful. However, it will be readily appreciated 
that detection is not limited to such techniques. Western blotting, dot blotting, FACS analyses, 
peptide arrays may also be -used to detect antigen/antibody interaction. 

Turning first to immunoassays, in their most simple and direct sense, preferred 
immunoassays of the invention include the various types of enzyme baked immunosorbent 
assays (ELISAs) known to the art. However, it will be readily appreciated that the utility of the 
E>III preparations described herein are not limited to such assays, and that other useful 
embodiments include RIAs and other non-enzyme linked antibody bmding assays or procedures. 

In some embodiments of the ELISA assay, flavivirus, TBE virus or West Nile virus 
envelope proteins or appropriate peptides incorporating Dili antigen sequences are immobilized 
onto a selected surface, preferably a surface exhibiting a protein affinity s~uch as the wells of a 
polystyrene microtiter plate. After washing to remove incompletely adsorbed material, one will 
desire to bind or coat a nonspecific protein such as bovine serum albrunin (BSA), casein, 
solutions of milk powder, gelatin, PVP, superblock, or horse albumin onto the well that is known 
to be antigenically neutral with regard to the test antisera. This allows for blocking of 
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nonspecific adsorption sites on the immobilizing surface and thus reduces the background caused 
by nonspecific binding of antisera onto the surface. Following an appropriate coating period (for 
example, 3 hours), the coated wells will be blocked with a suitable protein, such as bovine serum 
albumin (BSA), casein, solutions of milk powder, gelatin, PVP, superblocOc, or horse albumin, 
and rinsed several times (e.g., 4 or 5 times) with a suitable buffer, such as PBS. The wells of the 
plates may then be allowed to dry, or may instead be used while they are still wet. 

After binding of antigenic material to the well, coating with a non-reactive material to 
reduce background, and washing to remove unbound material, the inunobilizing surface is 
contacted with the antisera or clinical or biological extract to be tested in a manner conducive to 
immune complex (antigen/antibody) formation. Such conditions preferably include diluting the 
antisera with diluents such as BSA, bovine gamma globulin (BGG) and phosphate buffered 
saline (PBS)/Tween. These added agents also tend to assist in the reduction of nonspecific 
background. The layered antisera is then allowed to incubate for from 1 to 4 hours, at 
temperatures preferably on the order of 20° to 25°C. Following incubation, the antisera- 
contacted surface is washed so as to remove non-immunocomplexed material. A preferred 
washing procedure includes washing with a solution such as PBS/Tween, ox borate buffer. 

Following formation of specific immunocomplexes between the test sample and the 
bound antigen, and subsequent washing, the occurrence and even amount of immunocomplex 
formation may be determined by subjecting same to a second antibody having specificity for the 
first. Of course, in that the test sample will typically be of human origin, the second antibody 
will preferably be an antibody having specificity in general for human EgG, IgM or IgA. To 
provide a detecting means, the second antibody will preferably have an associated enzyme that 
will generate a color development upon incubating with an appropriate chromogenic substrate. 
Thus, for example, one will desire to contact and incubate the antisera-bound surface with a 
urease, alkaline phosphatase, or peroxidase-conjugated anti-human IgG for a period of time and 
under conditions which favor the development of immunocomplex formation (e.g., incubation 
for 2 hours at room temperature in a PBS-containing solution such as PBS-Tween). 

After incubation with the second enzyme-tagged antibody, and subsequent to washing to 
remove unbound material, the amount of label is quantified by incubation with a chromogenic 
substrate such as urea and bromocresol purple or 2,2'-azino-di-(3-ethylbenztMazoline-6-sulfonic 
acid (ABTS) and H 2 0 2 , in the case of peroxidase as the enzyme label. Quantification is then 
achieved by measuring the degree of color generation, e.g., using a visible spectra 
spectrophotometer. 
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In an exemplary embodiment, in each of the microtiter wells will be placed about 10 u.1 
of the test patient sample along with about 90 ul of reaction buffer (e.g., PBS with about l°/8 
digitonin or other mild protein solubilizing agent). Control wells of the EXISA plate will include 
normal sera (human sera without flaviviros antibody), and anti-flavivirus antibody .collected, 
from subjects. 

Irrespective of trie format employed, ELISAs have certain features in common, such as 
coating, incubating and binding, washing to remove non-specificalLy bound species, and 
detecting the bound immune complexes. These are described below. 

In coating a plate with either antigen ox antibody, one will generally incubate the wells o f 
the plate with a solution of the antigen or antibody, either overnight or for a specified period of 
hours. The wells of the plate will then be washed to remove incompletely adsorbed material. 
Any remaining available surfaces of the wells are then "coated" with a nonspecific protein that is 
antigenically neutral with regard to the test antisera. These include bovine serum albumiaa 
(BSA), casein or solutions of milk powder. The coating allows for blocking of nonspecific 
adsorption sites on the immobilizing surface and thus reduces the background caused b;y 
nonspecific binding of antisera onto the surface. 

In ELISAs, it is probably more customary to use a secondary or tertiary detection means 
rather than a direct procedure. Thus, after binding of a protein or antib ody to the well, coatin.g 
with a non-reactive material to reduce background, and washing to remove unbound material, tixe 
immobilizing surface is contacted with the biological sample to be tested under conditions 
effective to allow imrnune complex (antigen/antibody) formation. Detection of the immune 
complex then requires a labeled secondary binding ligand or antibody, and a secondary binding 
ligand or antibody in conjunction with a labeled tertiary antibody or a third binding ligand. 

"Under conditions effective to allow immune complex (antigen/antibody) formation." 
means that the conditions preferably include diluting the antigens and/or antibodies with 
solutions such as BSA, bovine gamma globulin (BGG) or phosphate buffered saline 
(PBS)/Tween. These added agents also tend to assist in the reduction of nonspecific 
background. 

The "suitable" conditions also mean that the incubation is at a temperature or for a period 
of time sufficient to allow effective binding. Incubation steps are typically from about 1 to 2 "to 
4 hours or so, at temperatures preferably on the order of 25°C to 2TC, or may be overnight at 
about 4°C or so. 

Following all incubation steps in am ELISA, the contacted surface is washed so as to 

remove non-complexed material. An example of a washing procedure includes washing witfct a 
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solution such as PBS/Tween, or borate buffer. Following the formation of specific immune 
complexes between the test sample and the originally bound material, and subsequent washing, 
the occurrence of even minute amounts of imrrxune complexes maybe determined. 

To provide a detecting means, the second or third antibody will have an associated label 
to allow detection. This may be an enzyme that will generate color development upon 
incubating with an appropriate chromogenic substrate. Thus, for example, one will desire to 
contact or incubate the first and second immurxe complex with a urease, glucose oxidase, alkaline 
phosphatase or hydrogen peroxidase-conjugated antibody for a period of time and under 
conditions that favor the development of further immune complex formation (e.g., incubation for 
2 hours at room temperature in a PBS-containing solution such as PBS-Tween). 

After incubation with the labeled antibody, and subsequent to washing to remove 
unbound material, the amount of label is quantified, e.g., by incubation with a chromogenic 
substrate such as urea, or bromocresol purple, or 2,2'-azino-di-(3-ethyl-"benzthiazoline-6-sulfonic 
acid (ABTS), or H 2 0 2 , in the case of peroxidase as the enzyme label. Quantification is then 
achieved by measuring the degree of color generated, e.g., using a visible spectra 
spectrophotometer. 

b. Assay Plates 

In some embodiments, the wells of th.e assay plates may first be coated with an anti-DUI, 
antiTBE-Din and/or axiti-WN-DHI antibody. This would immobilize DEI antigen to the plastic 
in the presence of a mild solubilizing buffer, such as from about 0.1% to about 10% digitoain 
(particularly about I'Vo digitonin). Such an approach is particularly efficacious in preparing 
assay plates with wells made of plastic. 

The assay plates in other embodiments of the invention comprise a multiplicity of 
microtiter wells, and in some embodiments, polystyrene microtiter wells. These wells would be 
coated with about 50O ng/well of the rDhl, XBE-rDHI or WN-rDHI antigen. 

c. Immunohistoc/temistry 

The antigens of the present invention may also be used in conjunction with both fresh- 
frozen and/or paraffin-embedded tissue blocks prepared for study by immunohistocheniisstry 
(EHC). Flavivirus, TBE virus and West Nile virus antibodies may be identified in this manner. 
The method of preparing tissue blocks from these particulate specimens has been successfully 
used in previous IHC studies of various prognostic factors, and/or is well known to those of sldll 
in the art (Brown et at , 1 990; Abbondanzo al. , 1 990; Alfred et al. , 1 990). 



29 



WO 2004/016586 



PCT/US2003/025681 



III. NUCLEIC ACID MOLECULES 

In some embodiments, the present invention concerns envelope protein domain HI 
antigens prepared from genomic or recombinant nucleic acids. Some of the teachings herrein 
pertain to the construction, manipulation, and use of nucleic acids -to produce a recombinant 
envelope protein domain IE antigen. 

A. Polynucleotides Encoding E protein domain III Envelope Antigens 

The present invention concerns polyirucleotides, isolatable from cells or viruses, that are 
free from cellular or viral genomic DNA or IRNA and are capable of* expressing all or part of a 
protein or polypeptide. The polynucleotide may encode a peptide or polypeptide containing all 
or part of an envelope protein domain IH amino acid sequence or may encode a peptide or 
polypeptide having am envelope protein domain IE antigen sequence. Recombinant proteins- can 
be purified from expressing cells to yield denatured or non-denatured proteins or peptides. 

As used herein, the term "DNA segment" refers to a DKA molecule that has Tbeen 
isolated free of total genomic DNA of a particular species or genomic RNA of a virus. 
Therefore, a DNA segment encoding a polypeptide refers to a DNA segment that contains wild- 
type, polymorphic, or mutant polypeptide-coding sequences yet is isolated away from, or 
purified free from, total viral RNA or, mammalian, or human genomic DNA. Included vrt&ua 
die term "DNA segment" are recombinant -vectors, including, for example, plasmids, cosrnids, 
phage, viruses, and the like. 

As used in this application, the term "envelope protein domain III (Dili) polynucleotide" 
refers to an envelope protein domain III polypeptide-encoding nucleic acid molecule that has been 
isolated free of total genomic nucleic acid. Therefore, a "polynucleotide encoding an env -elope 
protein domain III antigen" refers to a DNA. segment that contains all or part of envelope protein 
domain HI polypeptide-coding sequences isolated away from, or pxirified free from, totaL viral 
genomic nucleic acid. 

It also is contemplated that a particular polypeptide from a given species or strain irmay be 
represented by natural variants that have slightly different nucleic acid sequences but, 
nonetheless, encode the same protein (see a"bove). 

Similarly, a polynucleotide comprising an isolated or purified gene refers to a DNA 
segment including,, in certain aspects, regulatory sequences, isolated substantially away from 
other naturally occutrring genes or protein encoding sequences. In thus respect, the term "gene" is 
used for simplicity to refer to a functional protein, polypeptide, o>r peptide-encoding unit. As 
will be understood by those in the art, this functional term includes genomic sequences, <;DNA 
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sequences, RNA sequences and smaller engineered gexie segments that express, or may be 
adapted to express, proteins, polypeptides, domains, peptides, fusion proteins, and mutants. A 
nucleic acid encoding all or part of a native or modified, polypeptide may contain a contiguous 
nucleic acid sequence encoding all or a portion of such a polypeptide of the follo-Nving lengths: 
about lO, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 1 80, 190, 200, 
210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 
400, 41 0, 420, 430, 440, 441, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 5 50, 560, 570, 
580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 
770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, S80, 890, 900, 910, 920, 9>30, 940, 950, 
960, 970, 980, 990, 1000, 1010, 1020, 1030, 1040, 105O, 1060, 1070, 1080, 109O, 1095, 1100, 
1500, 2 000, 2500, 3000, 3500, 4000, 4500, 5000, 550O, 6000, 6500, 7000, 750O, 8000, 9000, 
10000, or more nucleotides, nucleosides, or base pairs., which may be contiguous nucleotides 
encoding any length of contiguous amino acids of SEQ DD NO:2, or any of SEQ It> NO:3-21. 

In particular embodiments, the invention concerns isolated DNA segments and 
recombinant vectors incorporating DNA sequences that encode a Dm antigen polypeptide or 
peptide, such as all or part of E>HI, which includes within its amino acid sequence a contiguous 
amino acid sequence in accordance with, or essentially corresponding to a native polypeptide. 
Thus, an isolated DNA segment, or vector containing a DNA segment may encode, for example, 
a Dili antigen that is capable of binding to an anti-flavrviras antibody. The term "recombinant" 
may be used in conjunction with a polypeptide or the name of a specific polypeptide, and this 
generally refers to a polypeptide produced from a nucleic acid molecule that has been 
manipulated in vitro or that is the replicated product of such a molecule. 

Encompassed by certain embodiments of the present invention are DNA segments 
encoding relatively small peptides, such as, for example, a peptide comprising all or part of an 
envelope protein Dili antigen (including at least one epitope) of any subtype or clade of 
flavivirus. 

The nucleic acid segments used in the present invention, regardless of thie length of the 
coding sequence itself, may be combined with other nucleic acid sequences, suclh as promoters, 
polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other coding 
segments, and the like, such that their overall length, may vary considerably. It is therefore 
contemplated that a nucleic acid fragment of almost any length may be employed, with the total 
length preferably being limited by the ease of preparation and use in the intended recombinant 
DNA protocol. 
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It is contemplated that the nucleic acid constructs of the present invention may encode 
full-length envelope protein from any flaevivirus or encode a truncated version of the 
polypeptide, for example a truncated envelope protein domain III polypeptide, such that the 
transcript of the coding region represents the truncated version. The trunc-ated transcript may 
then be translated into a truncated protein. Alternatively, a nucleic acid sequence may encode a 
full-length polypeptide sequence with additional heterologous coding sequences, for example to 
allow for purification of the polypeptide, transport, secretion, post-translational modification, or 
for therapeutic benefits such as targeting or efficacy. As discussed above, a tag or other 
heterologous polypeptide may be added to the modified polypeptide-encoding sequence, wherein 
"heterologous" refers to a polypeptide that is not the same as the modified polypeptide. 

In a non-hmiting example, one or more nucleic acid constructs may be prepared that 
include a contiguous stretch of nucleotides identical to or complementary to a particular gene, 
such as a envelope protein gene of a particular flavi^irus or subtype or strain of a flavivirus. A 
nucleic acid construct may be at least 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 
160, 170, 180, 190, 200, 250, 300, 400, 500, 600, 700, 800, 900, 1,000, 2,000, 3,000, 4,000, 
5.00O, 6,000, 7,000, 8,000, 9,000, 10,000 nucleotides in length, as well as constructs of greater 
size, up to and including chromosomal sizes (including all intermediate lengths and intermediate 
ranges), given the advent of nucleic acids constructs such as a yeast artificial chromosome are 
known to those of ordinary skill in the art. It will be readily understood that "intermediate 
lengths" and "intermediate ranges," as used herein, means any length or range including or 
between the quoted values (i.e., all integers including and between such valu.es). 

The DNA segments used in the present invention encompass immunologically or 
biologically functional equivalent modified polypeptides and peptides. Such sequences may 
arise as a consequence of codon redundancy and functional equivalency that are known to occur 
naturally within nucleic acid sequences and the proteins thus encoded. Alternatively, 
functionally equivalent proteins or peptides may be created via the application of recombinant 
DN.A technology, in which changes in the protein structure may be engineered, based on 
considerations of the properties of the amino acids being exchanged. Changes designed by 
human may be introduced through the application of site-directed mutagenesis techniques, e.g., 
to introduce improvements to the antigenicity of the protein, to reduce toxicity effects of the 
protein in vivo to a subject given the protein, ox: to increase the efficacy of any treatment 
involving the protein. 

The sequence of a flavivirus envelope protien Din polypeptide will substantially 
correspond to a contiguous portion of that shown dn amino acids 292-4021 of SEQ ID NO:3 or 
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any of SEQ ID NO:4-21 and have relatively few amino acids that are not identical to, or an 
immunological or a biologically functional equivalent of, the amino acids shown in amino acids 
292-402 of SEQ ID N0:3 or any of SEQ ID NO:4-21. The term "immunologically functional 
equivalent" or "biologically functional equivalent" is well understood in th.e art and is further 
defined in detail herein to include an ability to bind or be recognized by a specific flavivirus 
antibody. 

Accordingly, sequences that have between about 70% and about 80%; or more 
preferably, between about 81% and about 90%; or even more preferably, between about 91% 
and. about 99%; of amino acids that are identical or functionally equivalent to the amino acids of 
SEQ ID NO:3-21 will be sequences that are "essentially as set forth in SEQ ED NO:3-21 ." 

In certain other embodiments, the invention concerns isolated DNA segments and 
recombinant vectors that include within their sequence a contiguous nucleic- acid sequence from 
that shown in SEQ ID NO:l. This definition is used in the same sense as described above and 
means that the nucleic acid sequence substantially" corresponds to a contiguous portion of that 
shown in SEQ ID N0:1 and has relatively few codons that are not identical, or functionally 
equivalent, to the codons of SEQ ID NO:l. The term "functionally equivalent codon" is used 
herein to refer to codons that encode the same amino acid, such as the six codons for arginine or 
serine, and also refers to codons that encode biologically equivalent amino acids. See Table 2 
above, which lists the codons preferred for use in humans, with the codons listed in decreasing 
order of preference from left to right in the table (Wada et at, 1990). Codon preferences for 
other organisms also are well known to those of skill in the art (Wada &t al, 1990, included 
herein in its entirety by reference). 

The various probes and primers designed around the nucleotide sequences of the present 
invention may be of any length. By assigning numeric values to a sequence^ for example, the first 
residue is 1, the second residue is 2, etc., an algorithm defining all primers cart be proposed: 
n to n + y 

where n is an integer from 1 to the last number of the sequence and y is the length of the primer 
minus one, where n + y does not exceed the last number of the sequence. Thus, for a 10-mer, the 
probes correspond to bases 1 to 10, 2 to 11, 3 to 12 ... and so on. For a 15-mer, the probes 
correspond to bases 1 to 15, 2 to 16, 3 to 17 ... and so on. For a 20-mer, the probes correspond to 
bases 1 to 20, 2 to 21, 3 to 22 ... and so on. 

It also will be understood that this invention is not limited to the particular nucleic acid 
encoding amino acid sequences of SEQ ID NO:2, or any of SEQ ID NO:3-21. Recombinant 
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vectors and isolated DNA segments may therefore variously include the envelope protein DEI 
antigen-coding regions themselves, coding regions bearing selected alterations or modifications 
in the basic coding region, or they may encode larger polypeptides thai; nevertheless include 
envelope protein DAT antigen-coding regions or may encode biologically functional equivalent 
proteins or peptides that have variant amino acids sequences. 
1. Vectors 

Native and modified polypeptides maybe encoded by a nucleic acid molecule comprised 
in a vector. The term "vector" is used to refer to a carrier nucleic acid molecule into which a 
nucleic acid sequence can be inserted for introduction into a cell where it can be replicated. A 
nucleic acid sequence can be "exogenous," which means that it is foreign to the cell into which 
the vector is being introduced or that the sequence is homologous to a sequence in the cell but in 
a position within the host cell nucleic acid in which the sequence is ordinarily not found. 
Vectors include plasmids, cosmids, viruses (bacteriophage, animal viruses, and plant viruses), 
and artificial chromosomes (e.g., YACs). One of skill in the art would be well equipped to 
construct a vector throngh standard recombinant techniques, which are described in Sambrook et 
al, (2001) and Ausubel et al, 1996, both incorporated herein by reference. In addition to 
encoding a modified polypeptide such as modified envelope protein DDlI, a vector may encode 
non-modified polypeptide sequences such as a tag or targeting molecule. Useful vectors 
encoding such fusion proteins include pIN vectors (Enouye et al, 1985), vectors encoding a stretch 
of histidines, and pGEZX or pMAL vectors, for use in generating glutathione S-transferase (GST) or 
maltose bmding protein (MBP) soluble fusion proteins for later purification and separation or 
cleavage. A targeting molecule is one that directs the modified polypeptide to a particular organ, 
tissue, cell, or other location in a subject's body. 

The term "expression vector" refers to a vector containing a nucleic acid sequence coding 
for at least part of a gene product capable of being transcribed. In some cases, RNA molecules 
are then translated into a protein, polypeptide, or peptide. Expression vectors can contain a 
variety of "control sequences," which refer to nucleic acid sequences necessary for the 
transcription and possibly translation of an operably linked coding sequence in a particular host 
organism. In addition to control sequences that govern transcription and translation, vectors and 
expression vectors may contain nucleic acid sequences that serve other nmctions as well and are 
described infra. 

Vectors may include a "promoter," wihich is a control sequence that is a region of a 
nucleic acid sequence at which initiation and rate of transcription are controlled. It may contain 
genetic elements at which regulatory proteins and molecules may bind such as RNA polymerase 
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and other transcription factors. The phrases "operatively positioned," "operatively linked," 
"under control," and "under transcriptional control" mean that a promoter is in a correct 
functional location and/or orientation in relation to a nucleic acid sequence to control 
transcriptional initiation and/or expression of that sequence. A promoter may or may not be usea 
in conjunction with an "enhancer," which refers to a cis-acting regulatory sequence involved in 
the transcriptional activation of a nucleic acid sequence. 

A specific initiation signal also mscy be required for efficient translation of coding 
sequences. These signals include the ATG tnitiation codon or adjacent sequences. Exogenous 
translational control signals, including the AXG initiation codon, may need to be provided. One 
of ordinary skill in the art would readily be capable of determining this and providing the 
necessary signals. It is well known that the initiation codon must be "in-frame" with the reading 
frame of the desired coding sequence to ensxire translation of the entire insert. The exogenous 
translational control signals and initiation codons can be enter ixatural or synthetic. Tke 
efficiency of expression may be enhanced by the inclusion of appropriate transcription enhancer 
elements. 

In certain embodiments of the invention, the use of internal rioosome entry sites (IRES) 
elements are used to create multigene, or polycistronic, messages. IRES elements are able to 
bypass the ribosome scanning model of 5'— methylated Cap dependent translation and begin 
translation at internal sites (Pelletier and Soxienberg, 1988). IRES elements from two members 
of the picornavirus family (polio and enceplialomyocarditis) have been described (Pelletier and 
Sonenberg, 1988), as well an IRES from a mammalian message (Macejak and Sarnow, 199L). 
IRES elements can. be linked to heterologous open reading frames. Multiple open reading 
frames can be transcribed together, each separated by an IRES, creating polycistronic messages. 
By virtue of the IRES element, each open reading frame is accessible to ribosomes for efficieait 
translation. Multiple genes can be efficiently expressed using a single promoter/enliancer to 
transcribe a single message (see U.S. Patent 5,925,565 and 5,935,819, herein incorporated by 
reference). 

The vectors or constructs of the present invention will generally comprise at least one 
termination signal. A "termination signal" or "terminator" is comprised of the DNA sequences 
involved in specific termination of an RNAv transcript by an RNA polymerase. Thus, in certain 
embodiments a termination signal that ends the production of an RNA transcript is contemplated. 
A terminator may be necessary in vivo to acliieve desirable message levels. 

In eukaryotic systems, the terminator region may also comprise specific DNA sequences 
that permit site-specific cleavage of the new transcript so as to expose a polyadenylation site. 
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This signals a specialized endogenous polymerase to add a stretch, of about 200 A i 
(polyA) to the 3' end of the transcript. RNTA molecules modified with this polyA tail appear- to 
more stable and are translated more efficiently. Thus, in other embodiments involving 
eukaryotes, it is preferred that that terminator comprises a signal for the cleavage of the RKJA, 
and it is more preferred that the terminator signal promotes polyadenylation of the message. Xhe 
terminator and/or polyadenylation site elements can serve to enhance message levels and/or to 
minimize read through from the cassette into other sequences. 

Terminators contemplated for use in the invention include any known terminator of 
transcription described herein or known to one of ordinary skill in the art, including but mot 
limited to, for example, the termination sequences of genes, such, as for example the bovine 
growth hormone terminator or viral termination sequences, such, as for example the SV40 
terminator. In certain embodiments, the termination signal may h»e a lack of transcribable or 
translatable sequence, such as due to a sequence truncation. 

In expression, particularly eukaryotic expression, one will typically include a 
polyadenylation signal to effect proper polyadenylation of the transcript. The nature of the 
polyadenylation signal is not believed to be crucial to the successful practice of the invention, 
and/or any such sequence may be employed. Prefened embodiments include the SV40 
polyadenylation signal and/or the bovine growth hormone polyadenylation signal, convenient 
and/or known to function well in various target cells. Polyadenylation may increase the stability 
of the transcript or may facilitate cytoplasmic transport. 

In order to propagate a vector in a host cell, it may contain one or more origins of 
replication sites (often termed "ori"), xvhich is a specific nucleic acid sequence at winch 
replication is initiated. Alternatively an autonomously replicating sequence (ARS) cara be 
employed if the host cell is yeast. 

2. Host Cells 

As used herein, the terms "cell," "cell line," and "cell culture" may be vised 
interchangeably. All of these terms also include their progeny, which is any and all subsequent 
generations. It is understood that all progeny may not be identical due to deliberate or 
inadvertent mutations. In the context of expressing a heterologous nucleic acid sequence, '=host 
cell" refers to aprokaryotic or eukaryotic cell, and it includes any transformable organism th*at is 
capable of replicating a vector and/or expressing a heterologous gene encoded by a vectorr. A 
host cell can, and lias been, used as a recipient for vectors. A host cell may be "transfected" or 
"transformed," which refers to a process by which exogenous nucleic acid, such as a modified 
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protein-encoding sequence, is transferred or introduced into the host cell. A transformec3. cell 
includes the primary subject cell and its pro geny. 

Host cells may be derived from prokaryotes or eukaryotes., including yeast cells, insect 
cells, and mammalian cells, depending upon whether the desired result is replication of the 
vector or expression of part or all of the vector-encoded nucleic acid sequences. Numerovts cell 
lines and cultures are available for use as a host cell, and they can be obtained through the 
American Type Culture Collection (ATCC), which is an organiza.tion that serves as an airchive 
for living cultures and genetic materials (www.atcc.org). An appropriate host can be deterrrhned 
by one of skill in the art based on the vector backbone and the desired result. A plasmid or 
cosmid, for example, can be introduced into a prokaryote host cell for replication of many 
vectors. Bacterial cells used as host cells for vector replication anaVor expression include DH5oc, 
JM109, and KC8, as well as a number of commercially available bacterial hosts such as SURE® 
Competent Cells and SOLOPACK™ Gold Cells (Stratagene® La Jolla, CA). Alternajtively, 
bacterial cells such as E. coli LE392 could be used as host cells for phage viruses. Appropriate 
yeast cells include Saccharomyces cerevisiae, Saccharomyces pom.be, and Pichia pastoris. 

Examples of eukaryotic host cells for replication and/or expression of a vector iaiclude 
Vero, HeLa, NH3T3, Jurkat, 293, COS, CHO, Saos, and PC12. Many host cells from various 
cell types and organisms are available and would be known to one of skill in the art. Similarly, a 
viral vector may be used in conjunction with either a eukaryotic or prokaryotic host cell, 
particularly one that is permissive for replication or expression of the vector. 

Some vectors may employ control sequences that allow it to be replicated and/or 
expressed in both prokaryotic and eukaryotic cells. One of skill in the art would further 
understand the conditions under which, to incubate all of the above described host cells to 
maintain them and to permit replication of a vector. Also understood and known are techniques 
and conditions that would allow large-scale production of vectoxs, as well as production of the 
nucleic acids encoded by vectors and their cognate polypeptides, proteins, or peptides. 
3. Expression Systems 

Numerous expression systems exist that comprise at least a part or all of the comp ositions 
discussed above. Prokaryote- and/or eukaryote-based systems can be employed for use -with the 
present invention to produce nucleic acid sequences, or their cognate polypeptides, proteins and 
peptides. Many such systems are commercially and widely available. 

The insect cell/baculovirus system can produce a high level of protein expression of a 
heterologous nucleic acid segment, such as described in U.S. Patent No. 5,871,986, 4,879,236, 
both herein incorporated by reference, and which can be bought, for example, under the name 
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MaxBac® 2.0 fiom Invitrogen® and BacPack™ Baculovirxjs Expression System From 
Clontech®. 

In addition to the disclosed expression systems of the invention, other examples of 
expression systems include Stratagene®'s Complete Control™ Inducible Mammalian 
Expression System, which involves a synthetic ecdysone-inducible receptor, or its pET 
Expression System, an E. coli expression system. Another example of an inducible expression 
system is available from Invitrogen®, which carries the T-Rbx™ (tetracycline-regulated 
expression) System, an inducible mammalian expression system that uses the full-ler*.gth CMV 
promoter. INVITROGEN® also provides a yeast expression system called the Pichia methanolica 
Expression System, which is designed for high-level production of recombinant proteins in the 
methylotrophic yeast Pichia methanoZica. One of skill in the art would know how to- express a 
vector, such as an expression construct, to produce a nucleic acid sequence or its cognate 
polypeptide, protein, or peptide. 

IV. KITS AND DIAGNOSTICS 

The exemplary studies described herein show that xDIII is an excellerrt tool for 
differentiating infections caused by TBE serogroup versus mosquito-borne flavivirvases. This 
reagent would be particularly useful in regions where tick-borne and/or mosquito-borne 
flaviviruses are endemic, such as Ajsia, Europe and North America as well as economically 
depressed countries as it is relatively simple and inexpensive to produce. 

The studies described herein extend and improve upon the use of recombinant flavivirus 
envelope protein Dili for the detection of TBE and/or "WN virus infection. Recombinant DEI 
derived from the WN virus was found to be very specific and highly sensitive for identifying 
infection in naturally infected primates. Embodiments of the invention use rDIII as a diagnostic 
reagent for detecting TBE serocomplex virus infections. Assays using rDITf specific 
homologous and heterologous antiserum demonstrated a ver;y high degree of sensitivity and 
specificity and tests using mouse hyperimmune serum supported these results. _A potential 
drawback of the rDIII-based diagnostic assay may be the inability to differentiate between the 
TBE serocomplex viruses. It is contemplated that the minimization of potential binding epitopes 
may be accomplished by using peptide based diagnostic assa;ys. Peptide based ass-ays may be 
used to produce a greater degree of specificity to differentiate: the TBE serocomplex of viruses 
immunologically. In other embodiments of the invention, the use of the rDIII-based ELISAs as a 
rapid preliminary test for TBE virus infection can be followed by further clinical an<I laboratory 
tests such as virus isolation or neutralization assays to conclusively identify the virus causing 
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disease. In certain embodiments, rDIII can be used in a "dipstick" format by cross-linking the C- 
terminus of the protein to a solid substrate. This format would allow complete exposure of all 
rDHI antibody epitopes to test sera. The rDIII is an extremely stable protein as was shown by 
retention of the structure of TBE rDHI in up to 4M urea, 2M guanidinium hydrochloride and at 
low pH. The physical properties of the rDIII would lend themselves to the use of the rDIII 
reagent in tuxfavorable environmental conditions such as extreme heat or cold, or after extended 
storage. Recombinant protein technology for making these diagnostics reagents, will also 
minimize the cost of diagnosis, which in turn will make the use of such reagents feasible in 
economically depressed countries. 

In yet another aspect of the invention, a kit is envisioned for anti-flavivirus, anti-TBE 
virus or anti-West Nile virus antibody detection. In some embodiments, the present invention 
contemplates a diagnostic kit for detecting anti-TBE or anti-West Nile virus antibodies and 
human TBE or West Nile virus infection. The kit comprises reagents capable of detecting the 
anti-TBE or anti-West Nile antibody immunoreactive with the native or recombinant Dili 
antigens described here. Reagents of the kit include at least one Dill antigen, such a_s all or part 
of a TBE Din and/or West Nile Dm, and any of the following: another DEI antigen, buffers, 
secondary antibodies or antigens, or detection reagents, or a. combination thereof. 

In some embodiments, the kit may also comprise a. suitable container means = which is a 
container that will not react with components of the kit, such as an eppendorf tub>e, an assay 
plate, a syringe, or a tube. In specific embodiments, the kit comprises an array or chip on which 
one or more Dill antigen(s) is placed or fixed, such as those described in Reneke &t al, 1998, 
which is herein incorporated by reference. 

hi other embodiments of the invention, in addition to comprising a DUT antigen, it 
comprises a secondary antibody capable of detecting the anti-flavivirus, anti-TBE virus or anti- 
West Nile virus antibody that is immunoreactive with the recombinant Din antigen. 

The flavivirus antigen reagent of the kit can be provided as a liquid solution, attached to a 
solid support or as a dried powder. Preferably, when the reagent is provided in a liquid solution, 
the liquid solution is an aqueous solution. Preferably, when the reagent provided is attached to a 
solid support, the solid support can be chromatograph media, peptide array plate, plastic beads or 
plates, or a microscope slide. When the reagent provided is a dry powder, the powder can be 
reconstituted by the addition of a suitable solvent. In yet other embodiments, the kit may further 
comprise a container means comprising an appropriate solvent. 

In some embodiments, the kit comprises a container means that includes a- volume of a 
second antibody, such as goat anti-human IgG or IgM conjugated with alkaline phosphatase or 
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other anti-human Ig secondary antibody, and a second container means that includes a volume of 
a buffer comprising a non-denaturing solubilizing agent, such as about 1% digitonin. 

The kit may in other embodiments further comprise a third container means that includes 
an appropriate substrate, such as PNPP for alkaline phosphatase, or 9-dianisidine for peroxidase. 
A fourth container means that includes an appropriate "stop" buffer, such as 0. 5 m NaOH, may 
also be included with various embodiments of the kit. 

The kit may further include an instruction sheet that outlines the procedural steps of the 
assay, and will follow substantially the same steps as the typical EIA format known to those of 
skill in the art. 

EXAMPLES 

The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill nx the art that the techniques disclosed in the 
examples which follow represent techniques discovejred by the inventor to function well in the 
practice of the invention, and thus can be considered to constitute preferred modes for its 
practice. However, those of skill in the art should, in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and still obtain 
a like or similar result without departing from the spirit and scope of the invention. 

EXAMPLE 1 
Materials and Methods 

Virus strains and antigens 

Strains of WN, JE, arid St. Louis encephalitis (SLE) viruses used in thus study are listed 
in Table 3. All viruses were obtained from the World Arbovirus Reference Collection at the 
University of Texas Medical Branch at Galveston QJTMB). The WN strains were chosen to 
represent subtypes of both genetic lineages I and II- genotypes of these viruses had previously 
been determined by sequencing of a region corresponding to the NS5/3'-non-coding region 
junction. The protocols for propagation and nucleoside sequencing of these viruses have been 
described elsewhere (Beaslev et al, 2002). 

Whole virus suckling mouse brain-derived antigen preparations for WN (strain 385-99), 
JE (strain Nakayama), SLE (strain Parsons) and MVE viruses were also obtained from the World 
Arbovirus Reference Collection. 
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TABLE 3. Origins and genotypes of West Nile virus strains. 



Strain 


Origin 


Year of 
Isolation 


Lineaee* 


Designation 


385-99 


United States 


1999 


I 


USA99b 


EthAn4766 


Ethiopia 


1976 


I 


ETH76 


TL443 


Israel 


1952 


I 


ISR52 


Goldblum 


Israel 


1953 


I 


ISR53 


MRM16 


Australia 


1960 


I (Kunjin) 


AUS60 


804994 


India 


1980 


I (Indian) 


IND80 


E>akAnMg798 


Madagascar 


1978 


II 


MAD78 


SPU1 16-89 


South Africa 


1989 


II 


SA89 


DakArMg-979 


Madagascar 


1988 


II 


MAD88 


H-442 


South Africa 


1958 


n 


SA58 



Recombinant WN strain 385-99 envelop e protein domain TTT 

A fragment corresponding to structural domain III of the WN virus strain 385-99 
envelope protein (amino acids 296-415) was RT-PCR amplified for cloning and expression as a 
glutathione S-transferase <GST) fusion using the pGEX-2T system (Axnersham Pharmacia 
Biotech, Piscataway NJ). Protocols for expression and purification of tlie WN recombinant 
structural domain III of the envelope protein GST fusion protein (rDIH GST), followed by 
cleavage of the fusion protein and purification of WN rDHI away from the* GST fusion partner, 
were based on those described by Bhardwaj et al. (2001). Briefly, RNA. was extracted from 
c-ulture supernatant of virus-infected Vero cells using the QiaAmp kit (Qiagen Inc., Valencia 
CA) and reverse transcribed using the AMV Reverse Transcriptase with random hexamer 
primers (Roche). Specific fragments representing envelope protein structutral domain HI with 5' 
and 3' restriction sites suitable for cloning were amplified using Taq polymerase (Roche). PCR 
products were gel purified, cloned into pGEM-TEasy (Promega Corp., Mladison WI), digested 
using the appropriate restriction enzymes and subcloned into appropriately digested pGEX-2T 
vector. Inserts were sequenced in both directions to ensure fidelity of the products. 
Recombinant expression plasmids were transformed into DH5a E. coli for propagation and 
protein expression. Following induction, the fusion protein was purified on a glutathione 
sepharose column, and tDIII was subsequently cleaved from GST using thrombin (Novagen, 
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Madison WI) and purified on a DEAE anion exchange column. Homogeneity of rDIII was 
confirmed by mass spectroscopy (data not shown). 

Antisera and monoclonal antibodies 

WN rDni expressed and purified using the GST system was sent to Harlan Byproducts 
for Science (Indianapolis, IN) to be used as an antigen for the preparation of a polyclonal rabbit 
serum.. The antiserum was prepared using Harlan's standard immunization protocol in New 
Zealand White Rabbits (details available at "www.hbps.com"). Three WN Envelope protein 
reactive MAbs (5H10, 5C5 and 7H2) were obtained from Bioreliance Cop. (Rockville MD). The 
binding of these MAbs to domain III, differences in ttieir specificities, and the identification of 
putative binding sites for 5C5 and 5H10 are described elsewhere (Beasley and Barrett, 2002). 
Additional polyclonal mouse hyper-immune ascitic fluids (HIAF) against WN, JTE, SLE, MVE, 
dengue type 2 (DEN2) and yellow fever (YF) viruses were obtained from the World Arbovirus 
Reference Collection. 

Plaque reduction neutralization tests (PRND 

Ten-fold dilutions of virus (10' 1 to 10" 6 ) were prepared in MEM tissue culture medium 
(Sigma) containing 2% fetal bovine serum (FBS) and mixed with equal volumes of anti-WN 
MAb or polyclonal anti-WN-nDIH serum, diluted 1/200 or 1/20 respectively, or MEM media 
only. Virus-antibody mixtures were incubated at room temperature for 60 minutes before 
inoculation into monolayers of Vero cells in 6-well tissue culture plates (Corning Inc., Coming 
NY). Plates were incubated at room temperature for 30 rninutes to allow virus adsorption, then 
overlayed with 5 mL per well of MEM medium containing 1% agarose (MEM/agarose). After 
incubation at 37°C/5% C0 2 for a suitable period (two or three days for WN virus strains; four or 
five days for JE/SLE viruses) wells were overlayed "with an additional 2 mL of MEM/agarose 
containing 2% v/v neutral red solution (Sigma, St. Louis MO). Plaques were counted the 
follo-wing day and neutralization indices determined as the logi 0 reduction in virus titer in the 
presence of MAb/polyclonal serum compared with the medium only control. 

Indirect ELISA assays 

The wells of 96-well rnicrotiter plates (Cornirag Inc.) were coated overnight at 4°C with 
either WN, JE, MVE, or SLE virus antigen (equivalent to one pH 6.2 HA unit), or WN-rDIII 
protein (25 ng/well), diluted in borate saline (pH 9.0). These optimal dilutions of whole virus 
and recombinant antigens had been determined previously by titration against specific antisera 
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(data not shown). Wells were blocked for 60 mitrutes with a solution of 3<¥o bovine serum 
albumin in phosphate buffered saline (PBS) containing 5% tween-20 (PBS/frween), and then 
washed with PBS/tween. Serial doubling dilutions <1:100 - 1:6400) of anti-WN, -JE, -SLE, - 
MVE., -DEN2 and -YF mouse HIAFs were prepared in duplicate columns, the plates were 
incubated at room temperature for 45 minutes, and then washed four times with PBS/tween. 
Peroxidase-labeled anti-mouse immunoglobulin serum (Sigma) diluted 1:2500 iix PBS/tween was 
added to each well, and plates were again incubated, washed (four times with PBS/tween, twice 
with I>BS) and antibody binding visualized by addition of TMB substrate (Sigma). After 
incubating for 10 minutes at room temperature, color reactions were stopped by addition of 3M 
HC1 and absorbances read at 490 nm on a Fluoromark plate reader (BioRad, Hercules CA). 

Nucleotide sequencing 

RNA was extracted from WN virus-infected Vero cell supernataots and reverse 
transcribed as described earlier. A fragment that included the structural domain III coding 
sequence was RT-PCR amplified "using primers WTN1751 (5'- 
i75iTGrCATCAAGCTTTGGCTGGA 1 77o)(SEQ ID. NO:22) and W1M2504A (5'- 
2504TCTTGCCGGCTGATGTCTAT 24 85)(SEQ IDNOz23) for lineage I strains, o>x WN1739 (5 '- 
i75iTGCACCAAGCTCTGGCCGGAi 7 7o)(SEQ ID NO:24) and WN2^198A (5 ,- 
25ioCGGAGCTCTTGCCTGCCAAT 24 9i)(SEQ ID NO:25) for hneage E strains. Primer pairs 
were designed based on Genbank sequences AF1 96835 and M12294 (each of which is 
incorporated herein by reference), respectively, and are numbered according to residues in the 
AF196835 sequence. PCR products of the appropriate sizes were gel purified and directly 
sequenced using the ABI PRISM Big Dye v3.0 cycle sequencing kit (Applied Biosystems) on an 
ABI PRISM 3100 genetic analyzer (Applied Biosystems) according to the manufacturer's 
protocols. Sequence analysis was performed using the Vector NTI Suite package (Informax 
Inc.). 

Results 

Specificity of Polyvalent Anti-WN Domain HI Serum 

To determine the specificity of polyvalent anti-domain III rabbit serum PTRNT assays and 
Western blot with related JE serocomplex and other mosquito-borne fla.viviruses were 
performed, fn PRNT assays, the anti-domain m seruun neutralized WN strain 385-99 by more 
than 5000-fold (Table 4), while less than 10-fold reductions in titre were observed in assays with 
JE, SUB, DEN or YF viruses. In Western blot assays with JE, MVE and SLIE virus antigen 
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preparations the inventors observed some weak cross-reactivity with the envelope proteins of 
those viruses (FIG. 3). ha other western blot analysis the WN domain EI specific monoclonal 
antibodies were characterized (FIG. 4). 

TABLE 4. Variable neutralization of West Nile vims strains representative of genetic lineages I 
and II by Envelope protein domain Hi-specific monoclonal antibodies and a 
polyclonal antisenrm 



NEUTRALIZATION INDEX* AGAINST WN VIRUS STRAINS 


Serum 


WN strain 


5H10 


7H2 


5C5 


Anti-r>H[ 


USA99b 


2.3 


3.6 


2.5 


3.8 


ETH76 


2.7 


4.2 


2.4 


3.9 


ISR52 


2.2 


3.4 


2.4 


3.9 


ISR53 


0.9 


2.1 


1.9 


3.9 


AUS60 


1.1 


1.6 


1.1 


2.0 


INDSO 


1.7 


2.6 


2.5 


>5.6 


MAD78 


2.5 


3.1 


2.5 


>4.8 


SA89 


1.3 


1.7 


1.2 


2.7 


MAD88 


0.2 


0.1 


-0.2 


0.3 


SA58 


0.2 


0.1 


0.1 


0.6 



♦neutralization index is logi 0 reduction in virus titre in the presence of Mab/polyclonal serum compared with culture 
medium only control 

Variable Neutralization of WN Virus Strains "by Anti-Domain HI Serum and MAbs 
Having observed the specificity of the anti-domain III serum for WIST virus in PRNT 
assays (FIG. 5), the inventors then tested whether this reagent could distinguish between 
subtypes of WN virus. In addition, the subtype specificity of the neutralizing d-omain III reactive 
MAbs was examined. Although differences in neutralization did not clearly delineate viruses of 
different genetic lineages, some variable neutralization of WN subtypes was observed (Table 4). 
In general, viruses of genetic lineage I were efficiently neutralized by both the; polyclonal serum 
and the MAbs (-500- to 5000-fold reductions in titre), although neutralization of strain AUS60 
(lineage I, Kunjin) was approximately 10 to 100-fold lower than that of othex lineage I strains. 
Similarly, strain ISR53 was less efficiently neutralized by the MAbs than other lineage I strains, 
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although this strain was still strongly neutralized by the polyclonal anti-ciomain in serum. 
Lineage II virus strain MAD78 was also strongly neutralized by MAbs and polyclonal serum, 
while strains MAD88 and SA58 completely escaped neutralization (less than 10-fold reductions 
in titer in the presence of either MAbs or serum). Neutralization of strain SAS9 was incomplete 
(10- to 100-fold reductions in. titer only) and was comparable to that of AUS60 . 

Correlation of Domain III Amino Acid Sequence with Neutralization Plhenotvp e 
Analysis of derived Envelope protein domain in amino acid sequences for each WN 
straijn studied allowed the identification of residues ttxat appeared to influence "their neutralization 
phenotype (FIG. 6). Strains USA99b and ETUI 6 were identical throughout the region 
examined, while other lineage I strains differed at only one (ISR52 and ISR53 ) or three (AUS60, 
INT) 80) residues. Strain ISR.53, which partially escaped neutralization by the MAbs but not the 
polyclonal serum (Table 4), contained a Thr-»Ala substitution at E332 (amino acid 332 of the 
enevelope protein). Strain. AUS60, which partially escaped neutralization by MAbs and 
antiserum, differed at residues E310 (Lys^Thr), E339 (Val-»Ile) and E366 (-Ala->Ser) although 
the substitution at E339 was also observed in strain IND80, which did not escape neutralization. 
Additional substitutions in TND80 were identified at E312(Leu-»Val) and E390(Glu->Asp). A 
His— »Tyr substitution atE398 of strain ISR52 did not affect the neutralization of this strain. The 
lineage II strains studied all differed from USA99b at between two and four residues in domain 
HI (FIG. 6). Strain SA89, which displayed partial escape from neutralization by MAbs and 
antiserum, contained the smallest number of substituLtions, with changes at E312 (Leu-»Ala) and 
E369 (Ala-»Ser). Strains MAD88 and SA58, whic-li escaped neutralization by MAbs and anti- 
domain in serum, shared the substitutions at E3 12 and E369, and contained an additional 
substitution at E332 (Thr-^Lys). Strain MAD78, which was efficiently neutralized by both 
MAbs and antiserum, contained the greatest number of variable amino acids. . This strain 
contained the E369 (Ala->Ser) substitution observed in the other lineage n strains examined, a 
Le-u->Val change at E312 (also present in 1ND80)^ and additional unique substitutions at E371 
(Val->Ile) andE375 (Leu-»lle). 

Comparison with representative amino acid, sequences of the comparable region of JE, 
SLE and MVE viruses revealed much greater variation, and substitutions were present at each of 
the critical residues for neutralization that were identified in the WN virus strains, and also at 
clusters of residues around these loci (FIG. 6). 
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Enhanced Specificity of WN r-DITI in Indirect Elisa Compared with Whole Virus 
Antig ens 

The apparent type-specificity of functional epitopes in domain HI (as evidenced by the 
limited neutralizing activity of the anti-domain Ed serum against other JK serocomplex viruses 
and some strains of WIST lineage H) led us to investigate the utility of rEDUt as an antigen for 
serological assays. Indirect ELISAs were performed using a panel of MIAF raised against 
several mosquito-borne flaviviruses (see Materials and Methods). 

In assays where plates were coated with whole virus antigens (inactivated WN, JE, MVE 
or SLE viruses) extensive cross-reactivity was observed with most MIAF antisera (FIG. 7). ha 
general, the strongest reactions were observed between specific antigen/arxtiserum combinations 
(>.£. anti-WN serum with WN antigen). However, in each case, as least two other antisera 
reacted to at least 7S°/o of the homologous seruxn at dilutions between 1 : 100 and 1:800. The 
binding activity of the anti-MVE MIAF was louver than the other JE sexocomplex antisera in 
each assay, however its cross-reactive binding to "WN, JE or SLE antigens xvas at least 60% of its 
"binding to the MVE antigen. 

In contrast, the binding of anti-WN MIAF to WN rDIU antigen cleaved from a MBP 
fusion was clearly discriminated from the other antisera; values at dilutions between 1 :200 and 1 
-.6400 were at least three-fold higher than those of sera raised against otiher flavivirus antigens 
(TIG. 7). The peak values obtained using the rDHJ antigen were approximately 75% of those 
with whole virus WN antigen indicating some loss of sensitivity, as would be expected with the 
removal of binding sites contained in the remainder of the envelope protein. 

Further studies have shown that WN rr>in antigen cleaved from a GST fusion protein 
yields greater specificity in indirect ELISA assays compared with whole virus antigen 
preparations (FIG. 8). Ninety-six-well ELISA plates were coated with sucrose-acetone extracted 
virus antigens (WN, JE, SLE or MVE equivalent to 4 HA units at pH6.2) or WN rDIII antigen. 
Serial dilutions of polyclonal mouse antisera raised against WN, JE, SLE, MVE, DEN or YF 
viruses were added to wells of plates (optimal antigen and antiserum dilutions had been, 
determined by block titration of homologous aixtigen(Ag)/antibody(Ab) pairs); 2° Ab was HRP* 
anti-mouse Ig; substrate was TMB. 

Additional studies showed that the use of cleaved, purified WIN rDJJI antigen yields 
greater specificity in indirect ELISA assays than use of purified MBP-DIEI fusion protein antigen* 
(FIG. 9). In brief, 96-well ELISA plates were coated with either (a) WN rDHI Ag (-15 ng/well> 
or WN rDIJJ as an MBP fusion (-35 ng/well and -1.75 ng/well total protein in (b) and (c> 
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respectively, which represents ~7 ng/well or 0.35 ng/well WN rDHI)- Assays were performed 
using serial dilutions of polyclonal mouse sera as described previously. Note greater cross- 
reactive (possibly non-specific) binding in p>anel (b). Further dilution of MBP rDHI fusion 
protein antigen reduces apparent cross reactivity but with marked reduction in sensitivity. 
EXAMPLE 2 

Materials and Methods 

Generation of recombinant Domain TIT: 

Recombinant domain m (rDHJ) protein was expressed in E. coli as a fusion protein using 
maltose-binding protein (MBP) as the fusion partner. Expression and purification wa_s 
essentially following the manufacturer's instructions and was previously described. Briefly, th_e 
coding sequence for domain m of the viral envelope protein was cloned into the pMAL-c2rx 
expression vector CNew England Biolabs). The individual Dili molecules encompassed 
approximately residues 300-395 of the viral envelope protein. Cloning into the pMAL system 
added an additional serine to the N-terminu-s of the recombinant proteins. The fusion protein 
was expressed by induction with JPTG. Purification was achieved via lysing the cells b»y 
sonication followed by affinity purificatiozn over an amylose resin column (New England 
Biolabs). The fusion protein was cleaved with Factor Xa (Novagen) and the MBP and rDTTT 
separated by size exclusion chromatography- on a Superdex 75 column (Amersharn/Pharmacia). 
Domain III was concentrated and stored at 4°C until use. The TBE rZDHI protein has been found 
to extremely stable under very stringent conditions (Bhardwaj et al. 2001, White et al, 200 3) 
and is stable when stored at 4°C for extended periods. 

Antiserum production: 

Purified rDIII was provided to Harlan Bioproducts for Science (Indianapolis, IN) for 
production of rabbit antisera. Antiserum against each rDHI protein, was produced in two New 
Zealand white rabbits. Testing of the antisera in ELISA and western blot assays found little 
difference between, antisera generated in different rabbits against the same antigen (M. Holbrook, 
unpublished observations). 

Antigens and mouse immune asc iti c fluids: 

Suckling mouse brain-derived viral antigens from dengue-2 (DEN2), dengue-4 (DE1ST4), 
yellow fever (YF) vaccine strain 17D, Japanese encephalitis (JE) strain Nakayama, Lairgat 
(LGT) strain TP21 and Powassan (POW) strain LB were obtained from the World Arbovirus 
Reference Collection housed at the University of Texas Medical Branch. In addition, momse 

47 



WO 2004/016586 



PCT/US2003/025681 



hyperimmune ascitic fluid (MIAF) against DEN2, DEN4, JE, YF, West Nile (WN), LGT, POW, 
KFD and RSSE were also obtained from the World Arbovirus Reference Collection. 

Western blots: 

Ten nanograms (ng) of purified rDHI was run on 12% SDS-PAGE gels and transferred to 
a nitrocellulose membrane for blotting. The blots were blocked with TBS-tween (20 mM Trds, 
pH 7.5, 150 mM NaCl, 0.05% tween 20) containing 3%. dry milk powder (Blotto) for at least 30 
min. at room temperature. The membranes were probed for 1 hr at room temperature with the 
appropriate antiserum diluted in Blotto at dilutions of 1:800-1:1000 dependent upon the 
antiserum. Blots were washed 3 times with Blotto and probed with a goat anti-rabbit-horseradish 
peroxidase (HRP) conjugated secondary antibody (Sigma) at a 1:2000 dilution in Blotto for 1 hr 
at room temperature. The blots were subsequently washed twice with Blotto and three times 
with TBS-tween. The presence of rDIIT was detected using the ECL chemiluminescence 
substrate (Amersham/Pharmacia). 

Indirect ELISAs: 

Purified rDIII or mouse brain-derived viral antigen (Ag) was coated onto 96-well round 
bottom microtiter plates (Falcon) overnight at 4°C in borate saline fcuffer (120 mM NaCl, 50 cnM 
boric acid, pH 9.0). Preliminary experiments examining sensitivity of the assay found that wells 
coated with 10-2O ng of rDIII provided optimum sensitivity while Ag was coated in plates at 1 
hemagglutination (HA) unit per well. Wells were blocked with. PBS-tween (PBS with 0-5% 
tween-20) containing 3% bovine serum albumin (BSA) for 30 min. at room temperature then 
washed once with PBS-tween prior to incubation with antisera. Two-fold serial dilutions of 
antisera were made in duplicate wells. All dilutions were made hx PBS-tween. Following a 1 hr 
room temperature incubation with primary antibody, the plates -were washed with PBS-tween 
and then incubated with either goat anti-mouse or goat anti-rabbit HRP conjugated secondary 
antibody at a 1:2000 dilution for 1 hr at room temperature. The plates were washed and then 
incubated with 50 ul 3,3',5,5'-Tetrametraylbenzidine (TMB) (Sigma) colorometric detection 
reagent for 5 min at room temperature. The reaction was stopped with 50 ul 3M HC1 anc3 the 
plates were read at 450 nm with a reference wavelength of 595 run. 
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Results 

Cloning of viral Dili: 

The rDHI used in these assays were cloned from viruses representing several mosquito- 
borne flaviviruses and the major clades of the TBE serocomplex with the exception oF the 
Siberian and Far-eastern subtypes of viruses (FIG. 10). Viral RUNfA for the Siberian and Far- 
eastern subtypes was not available as they are BSL-4 agents with restricted availability. 
Kumlinge (KUM) virus is a strain of CKE while OHF and KFD viruses are viruses that & ause 
hemorrhagic fever rather than an exclusively encephalitic disease and form distinct subgroups 
wimin the serocomplex. LGT and POW viruses also represent distinct subgroups of the TBE 
serocomplex (FIG. 10). LGT is a naturally attenuated virus originally isolated in Malaysia, and 
POW may represent an older lineage of TBE viruses in North America and Asia (Gould e?t al, 
2001, Zanotto et al, 1995). In addition to members of the TBE serocomplex, rDHI fronci the 
mosquito-borne WN, YF vaccine strain 17D and YF wild-type strain Asibi were also produced. 
The amino acid sequence within the Dm of all flaviviruses is similar, but the level of identity 
within the TBE serocomplex is quite high (FIG. 16). This high degree of similarity makes -these 
viruses difficult to distinguish serologically. 

Western, blots: 

Purified rDHI derived from several mosquito- and tick-borne flaviviruses were run on 
SDS-PAGE gels and transferred to nitrocellulose for blotting witib homologous and heterologous 
rabbit anti-rDIEI specific antiserum. These assays found a significant degree of cross-reactivity 
between rDHI derived from members of the tick-borne flavivims serocomplex (FIG. 11). All 
five TBE serocomplex antisera recognized the five TBE serocomplex rDHI, though the sera 
tended to cross-react less well with LGT rDIH, and the rabbit anti-POW rDHI antiserum 
appeared to have less cross-reactivity tttan other sera. This result is not surprising as LGT and 
POW viruses are phylogenetically less related than KUM, OHF and KFD viruses (FIGr. 10). 
None of the rahbit anti-TBE serocomplex antisera recognized rDTH derived from the mosquito- 
borne flaviviruses WN or YF, nor did rabbit anti-YF or anti-WN antisera recognize any of the 
TBE rDIH (FIG. 11). 

ViraL axitigen based ELISAs: 

Mouse brain-derived viral antigens were coated in 96-well plates at one hemagglutination 
(HA) unit per well. Din specific sera and MIAF were diluted, at two-fold serial dilutioms and 
sensitivity and specificity of the assay determined. As seert in FIG. 12 there is a l^ack of 
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specificity for TBE serogroup viral antigens using MIAF. MCIAF generated against tick-borne 
flaviviruses are shown in open symbols while the remaining symbols comprise mosquito-borne 
flaviviruses. In all assays JE MIAF cross-reacted strongly with all of the antigens tested. The 
assay that demonstrated clear specificity was that against JE mouse-derived antigen where the JE 
MIAF clearly reacted well with the antigen. In the remaining panels, little specificity was found 
for MIAF binding to mouse-brain derived viral antigen clearly demonstrating that this antigen is 
not suitable for a diagnostic assay. In these experiments, the 3MIAF were not normalized against 
homologous rDIH or virus-derived antigens prior to performirtg the studies. Instead, tfcie MIAF 
were tested as received from the World Arbovirus Reference Collection. Due to the lack of 
availability of sera from natural infections, this method was undertaken to mimic the testing of a 
potentially infected individual in a true diagnostic setting. In some cases, such as is apparent 
with JE virus MIAF, the reactive antibody titer may be higher than other MIAF and give a higher 
level of cross-reactivity. Normalization of the MIAF might reduce the cross-reactivity, but it 
would also bias the study. 

In similar studies using raobit anti-rDHI specific antiserum to screen against virus- 
derived antigen, cross-reactivity was also observed. As seen in FIG. 13, though the degree of 
cross-reactivity is not as great as was seen in FIG. 12, both rabbit rDIJJ antiserum specific for the 
Dili of LGT and WN viruses reacted with several viraL antigens. Even though, specific 
antiserum was used in the assay, based on results from western blots (FIG. 11), significant cross 
reactivity between mosquito-borne -virus antigens and antisera specific for tick-borne viruses was 
found. Again, the antisera were not normalized prior to use in these studies. These ^results, in 
conjunction with those shown in FIG. 11, demonstrate that tlie use of mouse brain-derived viral 
antigen in a diagnostic assay does not provide the specificity- required to conclusively identify to 
agent responsible during flavivirus infection. 

The majority of the mouse "brain-derived viral antigens tested in these experiments were 
representative of the mosquito-borne flaviviruses. Unfortunately, the assay could not be 
performed using more TBE serocomplex antigens as some were not available from the World 
Arbovirus Reference Collection and others that were avaiLable in the collection could not be 
tested due to concerns about the complete inactivation of the virus during antigen preparation 
(i.e., live virus might be in the antigen preparations) and inadequate facilities for tested 
potentially infectious antigens {e.g. , BSL-4 for OHF and KFD antigens). 



50 



WO 2004/016586 



PCT/US2003/025681 



Domain III base d ELTSAs 

ELISAs using rDE as the antigen, ra_ther than mouse brain-derived viral antigen, 
demonstrated a much more specific reaction against homologous rDHI-specific antiserum. Both 
WN and YF rDIH reacted only with homologous serum (true for both YF wild-type Asibi strain 
and vaccine 17D strain. rDIH) (FIG. 14F-14H). The YF-Asibi rDffl rabbit antiserum cross- 
reacted with rDHI derived from YF vaccine strain 17D, an expected result as these envelope 
proteins are nearly identical (FIG. 14G). A stmilar result was seen in YF-17D rDIH coated 
plates (FIG. 14H). Recombinant DEI derived firom the TBE serocomplex of viruses, however, 
were not specific for individual vims rDHI specific rabbit antisera, but were cross-reactive with 
rr>in derived from viruses only within the TEtE serocomplex (FIG. 14A-14E, open symbols 
represent tick-borne flaviviruses). This result supports the western blot data presented in FIG. 1 1 
where cross-reactivity was seen between the rabbit antisera generated against the recombinant 
proteins of the TBE serocomplex. These assays found that TBE serocomplex derived rDIH 
cross-reacted with all of the TBE serocomplex specific rabbit anti-rDHI antisera, but not those 
derived from the mosquito-borne WN or YF viruses. This assay was also quite sensitive as 
serum diluted to 1:320 could easily be detected above a 0.2 OD450 cut-off for a positive test. 
The cross-reactivity among the TBE serocomplex viruses was somewhat expected as the level of 
amino acid identity among the envelope protein DEI from these viruses is very high (FIG. 16). 

To examine the ability of rDm to detect the presence of IgG in a model for analysis of 
test serum from a potentially infected individual, MIAF were assayed in plates coated with rDHI 
in experiments similar to those shown above using mouse brain-derived, viral antigen. In these 
experiments, it was fovind that the rDm coated plates were able to clearly differentiate MIAF 
derived from TBE serocomplex infected animals from those of mosquito-borne viruses (FIG. 
15). As seen in panels A-E of FIG. 15, TBE serocomplex rDHI cross-reacted with the majority 
of the TBE serocomplex MIAF tested. As with previous figures, TBE serocomplex specific 
MIAF are shown in open symbols. POW MLAF seemed to cross-react with all of the TBE rDin 
whereas the RSSE MtAF was somewhat less reactive. POW MIAF was also the only MIAF to 
react with OHF rDIII and with considerably less sensitivity than the other rDIH coated plates 
(FIG. 15E). Unfortunately, OHF specific MJLAF was not available from the World Arbovirus 
Reference Collection. Recombinant Dili for mosquito-bome flaviviruses was also highly 
specific as the WN MIAF reacted only with WIST rDHI, as was previously shown (FIG. 15F) and 
the YF-17D rD3 reacted with YF MIAF (FIG. 15G) though the sensitivity of this assay was not 
as high as with the TBE serocomplex rDIH or WN rDHI. Both of the YF rDIII cross-reacted 
with JE MIAF indicating potentially similar surface amino acid residues- 
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CLAIMS 

1 . A method of screening for a flavivirus in a subject o»r animal host comprising: 

a) contacting a sample from the subject or animal with a composition coroprising a 
flavivirus envelope protein domain IH polypeptide under conditions that permit 
formation of specific immunocomplex between an antibody in the samrple and the 
envelope protein domain HI polypeptide; an_d 

b) detecting whether a specific hnmunocomplex is formed. 

2. The method of claim 1 , wherein the envelope protein domain III polypeptide is a yellow 
fever vinxs envelope protein domain III polypeptide, West "Nile virus envelope protein domain III 
polypeptide, St. Louis encephalitis virus envelope protein domain 111 polypeptide, Murray Valley 
encephalitis virus envelope protein domain HI polypeptides, tick borne encephalitis s&iocomplex 
virus envelope protein domain m polypeptide or a combination thereof. 

3. The method of claim 1, wherein the envelope protein domain III polypeptide is not a 
dengue fever virus envelope domain m polypeptide. 

4. The method of claim 1, wherein the envelope protein domain III polypeptide is not a 
fusion protein. 

5 . The method of claim 1 , wherein the envelope prot ein domain III polypeptides is a West 
Nile virus envelope protein domain IT! polypeptide. 

6. The method of claim 1 , -wherein the envelope protein domain III polypeptide comprises 
an amino acid sequence that has at least an 80% identity ^svith SEQ ID NO: 1 1 . 

7. The method of claim 6, wherein the envelope pro"tein domain ITI polypeptide comprises 
an amino acid sequence that has at least an 85% identity -with SEQ ID NO:l 1 . 

8. The method of claim 1, wherein the envelope protein domain III polypeptide comprises 
an amino acid sequence that has at least an 90% identity with SEQ ID NO:l 1 . 
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9. The method of claim. 8, wherein the envelope protein domain III polypeptide comprises 
an amino acid sequence that has at least an 95% identity with SEQ ED NO:l 1 - 

10. . The method of claim 1 , wherein the envelope protein domain III polypeptide comprises 
amino acids 292 to 402 as set forth in SEQ ID NO:3 o»r an amino acid sequence set forth in SEQ 
ID NX): 8-21. 

1 1 . The method of claim 1 , wherein the envelope protein domain III polypeptide comprises 
an arriino acid sequence as set forth in SEQ ED NO:8-20. 

12. The method of claim 1, further comprising at least a second envelope protein domain III 
polypeptide. 

13. The method of claim 1, wherein the immuno&omplex is detected using anti-antibody 
secondary reagents. 

14. The method of claim 1 , wherein the immunocomplex is detected by ELIS A. 

1 5 . The method of claim 1 , wherein the immunocomplex is detected by ^Western blotting. 

16. The method of claim 1 , wherein the immunocomplex is detected by peptide array. 

17. The method of claim 1 , wherein the subject is a bird. 

18. The method of claim 1, wherein the antibody is an IgA, IgM or IgG antibody. 

1 9 . The method of claim 1 , wherein the envelope protein domain III polypeptide is obtained 
from a bacteria, a mammalian or an insect cell comprising an expression vector encoding the 
envelope protein domain III polypeptide. 

20. The method of claim 1, wherein the subject 5is infected with West Nile virus or a tick 
borne encephalitis serocomplex virus. 
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21. A composition, comprising an isolated West Nile virus or tick borne encephalitis 
serocomplex virus envelope protein domain III polypeptide. 

22. The composition of claim 21, wherein the West Nile virus envelope protein domain HI 
polypeptide is derived from West Nile strain 382-99, EthAn4766, 385-99, Kunjin MRM16, 
Golblum, TL443, DakAnMg798, or 804994 

23 . The composition of claim 21 , wherein the West Nile virus envelope protein domain m 
polypeptide comprises an amino acid sequence that is at least 80% identical to SEQ ED NO:ll. 

24. The composition of claim 23, wherein the West Nile virus envelope protein domain III 
polypeptide comprises an arnino acid sequence that is at least 85% identical to SEQ ID NO:l 1 . 

25. The composition of claim 24, wherein the West Nile virus envelope protein domain III 
polypeptide comprises an amino acid sequence that is at least 90% identical to SEQ ID NO: 11 . 

26. The composition of claim 25, wherein the West Nile virus envelope protein domain III 
polypeptide comprises an arnino acid sequence that is at least 95% identical to SEQ ID NO:ll. 

27. The composition of claim 26, wherein the West Nile envelope protein domain III 
polypeptide comprises amino acids 292 to 402 as set forth in SEQ ID 3NTO:3 or an amino acid 
sequence set forth hx SEQ ID NO:l 1 . 

28. The composition of claim 21, wherein, the envelope protein domain III polypeptide is 
operatively linked to a substrate. 

29. The composition of claim 28, wherein the substrate is a microtiter plate, a bead or a 
microarray. 

30. The composition of claim 21, wherein the composition is a vauccine composition. 

3 1 . The composition of claim 30, further comprising an adjuvant. 
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32. A kit for screening for flavivirus antibodies, in a suitable container means, comprising at 
least one envelope protein domain III polypeptide. 

33. The kit of claim 32, wherein the at least one envelope protein domain HI polypeptide is a 
yellow fever virus envelope protein domain HI polypeptide, West Nile virus envelope protein 
domain III polypeptide, St. Louis encephalitis virus envelope protein domain EH polypeptide, 
Murray Valley encephalitis virus envelope protein domain III polypeptide, a Central European 
encephalitis (CEE) virus envelope protein domain III polypeptide, a louping ill (LI) virus, a 
Russian spring-summer encephalitis (RSSE) virus enveLope protein domain III polypeptide, a 
Langat (LGT) virus envelope protein domain IE polypeptide, a Powassan virus (POW) envelope 
protein domain IE polypeptide, an Alkhurma (ALK) envelope protein domain EC p olypeptide, a 
Kyasanur Forest disease (KFD) -virus envelope protein domain III polypeptide, an Omsk 
hemorrhagic fever (OHF) virus envelope protein domain III polypeptide or a combination 
thereof. 

34. The kit of claim 32, wherein at least one envelope protein domain EI polyp eptide is a 
West ISfile virus envelope protein domain III polypepticLe. 

35. The kit of claim 32, wherein the envelope protein domain EI polypeptide comprises one 
or more of amino acids 292 to 402 as set forth in SEQ ED NO:3 or an amino acid s equence set 
forth in SEQ ID NO:8-20. 

36. The kit of claim 32, further comprising a non-reactive solid support to which the at least 
one envelope protein domain LII polypeptide is attached. 

37. The kit of claim 32, further comprising a first agent that detects an immunocomplex 
comprising the envelope protein domain EI polypeptides. 

38. The kit of claim 37, wherein the first and second agents are secondary antiljodies that 
specifically bind flavivirus antibodies. 

39. The kit of claim 37, wherein the first and second agents comprise a detectable label. 
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40. The kit of claim 39, wherein the detectable label is fluorescent, radioactive, colorimetric, 
or enzymatic. 

41 . A kit for screening for West Nile vims antibodies in a subject comprising: 

a) an assay plate comprising a multiplicity of microliter wells comprising a 
composition comprising at least one envelope protein domain 111 polypeptide 
capable of binding a flavivirus antibody in the sample that can specifically bind, to 
at least one envelope protein domain III polypeptide; and 

b) a container means comprising a labeled secondary antibody having specific 
binding affinity for a flavivirus antibody in the sample that can specifically bind 
to at least one envelope protein domain III polypeptide. 

42. A kit for screening for TBE seroconxplex antibodies in a subject comprising: 

a) an assay plate comprising a multiplicity of microtiter wells comprising a 
composition comprising at least one envelope protein domain ITI polypeptide 
capable of binding a flavivirus antibody in the sample that can specifically bind to 
at least one envelope protein domain III polypeptide; and 

b) a container means comprisirxg a labeled secondary antibody having specific 
binding affinity for a flavivirus antibody in the sample that can specifically bind 
to at least one envelope protein domain III polypeptide. 

43. A method of screening for flaviviius in a subject comprising: 

a) contacting a sample from the subject with a composition from the kit of claim 32; 
and, 

b) detecting whether an irnmuriocomplex is formed between an antibody and the= at 
least one envelope protein domain III polypeptide. 



59 



WO 2004/016586 



1/16 



PCT/US2003/025681 



DEH1 

DEN3 

D3SN2 

DEN4 

JE 

MV 

VTN 
SLR 



kgvsyvmc^gsfkiiekelvaetqhgtvlvqvkyegtdapcicapfssqdekgvt 
kgmsyamcl-intfvbkkevsetqhgtilikveykgedftpcicipfstedgqgka 
kgmsysmc^gkfkvveeiaetq40tivirvqyegdgspcicipleimdldnrh 
kgmsytmcs^gkfsidkemaetqhgttwkvkyegagapckrvpieirdwkek 
kgttygmc^yekfsfaknpadtghgtwielsysgsdgpcicipivsvasliildm 
kgttygmcijekftfsra^padtgiigtvvlelqytgsdgpcklipissvaslndm 
kgttygvcsjikafrflgtpadtgifgtwlelqytgtdgpcicipissvaslndij 

kgttygmclj|saftfsk3stptdtghgtvivelqytgsngpck.vpisvtanlmdl 
kgtsykmc^|dkmsfvk3siptdtghgtavmqvkvpkg-apcr.ipvmvaddltas 
kgltytmc^tkftwki^ptdsghStvvmevtfsgt-kpcripvravahgspd 
kgmtyotce^cf skfawkr-eptds gi0twmevtytgs -kpcr.i pvravahgbpn 

KGLTYTMCD^TKFTWKR-APTDSGI&TVVMEVTFSGT- kpcri PVRAVAHGSPD 
KGLTYTMCl|^SKFAWKRTPTDSGH^TVVMEVTFSGS-KPCR.IPVRAVAHGSPD 
KGLTYWCliMTKFTWKR_APTDSGltflTVVMEVGFSGT-EPCR.IPVRAVAHGVPE 
KGLTYTMCI|3|AKFTWKRJ^TDSGBDTVVMEVAFSGT-KPCE?.IPVRAVAHGSPD 
KGTTYSMCDfeVKFKWKR.VPVDSGHBTVVMEVSYTGSDKPCR.IPVRAVAHGVPA 



0 


IDEN1 
3DEN3 
DEN2 


1 


3DEN4 


I 


JE 




MV 

jam 




SLE 






M 


TBE 
KPD 


tl 


KDM 
IiX 




LGT 




OHF 
POW 



Q-N^RLITANPIVIDKEK--PW1E-"E PPFGET^ "'V^^GEKALKLSWI 
H -M -RLITANPWTKKEE - - PVNIEAE^ P#Gg|si|yiGI GDKALKINW 
/ L -P LITVWPIVTEKOS - - PVNVEAE-j PP|iGli&Yi'IIGV"^PGQLKLNVjSli 
V-v4RIISSTPI 1 AENTNS--VTniLjE-RP' J C "jTIVIGV GNSALTLHW^J 



TP^RLVTVNPFVATS SANS KVLVEME-f F PFi 3DSYXWGR.GDKQ INHHWBI 
TP^RMVTANPYVASSOTANAKV^ 

'GR GEQQ INHHWHf 

VPX GR ^VTVNPFVS VATANAKVL I L E J iT R_GE( IHH f 3 
TPV^RLVTWPFISTGGANNKVMIE ; 1 - PFGDS'i IWGR-GTTQINYHWlftE 
WK^ILVTVNPIASTN"ED--EVL E P-FGDSYII C ^DSRLTYQWlkfcE 

VNV^MLITPNPTIi5NN"GC - ^GFIEMQLPP-GDHIIYVG- - ELSYQWKQK 

OT ' SLITPNPSMENTGG ! 5LQLFF 3DI I " EI '.."(.r 

VNVjliMLITPNPTIENN-GG-I^GFIEMQLPP-Cr j 3- EL3HQUFQ) 
N" M t j TIE D< i I n rQT 

vTS! 3 LITPNPTt OTSTGG - 3FIEMQ1 11 GDNlfi 3 r .u TY i QK 

j r 'if 11, j r T r J • c i i i 

\ f CTPM riETlSTG e j - n -1 



FIG.1 



WO 2004/016586 



2/16 



PCT/US2003/025681 




WO 2004/016586 



3/16 



PCT/US2003/025681 




FIG. 3 



WO 2004/016586 



4/16 



PCT/US2003/025681 




WO 2004/016586 



5/16 



PCT/US2003/025681 









I* 


99 










IS 


anti-WN 




§ 




9v 


80% 1 
antiLGT 






? s 




1 


II 




1 £ 


Si 



5 




WO 2004/016586 



6/16 



PCT/US2003/025681 




s 33 (U (X \cn Q C 
J M co CO ED S f 



vo eg mlolo a> cnlooicol ty, w w 

crv r> in ir> ko a> r~ oo colin (-> ^ |r 

cn aj 05 cc as a O < r' Jl 

co tob g <* - 
co w m H k k * 



' i — W Wll 

11 # 



WO 2004/016586 PCT/US2003/025681 




WO 2004/016586 PCT/US2003/025681 




WO 2004/016586 



10/16 



PCT/US2003/025681 



HI 



VlVIosquito-boi 



- Omsk hemorhagic fever 



>Tick-borne 



FIG. 10 



WO 2004/016586 



11/16 



PCT/US2003/025681 



Recombinant D3 




FIG. 11 



WO 2004/016586 



12/16 



PCT/US2003/025681 




FIG. 12A-12F 



WO 2004/016586 



13/16 



PCT/US2003/025681 




FIG. 13A-13F 



WO 2004/016586 



14/16 



PCT/US2003/025681 




FIG. 14A-14H 



WO 2004/016586 



15/16 



PCT7US2003/025681 




FIG. 15A-15H 



WO 2004/016586 



16/16 



PCT/US2003/025681 





DEN1 


o 


DEN2 


f 


DEN3 




DEN 4 


1 


JE 


WN 




YF 




RSSE 




CEE 




LI 


H 


LGT 




POW 




KFD 




OHF 




DEN1 




DEN2 




DEN 3 




DEN 4 


Mo 


JE 
WN 




YF 




RSSE 




CEE 




LI 




LGT 


H 


POW 




KFD 




OHF 



300 351 
KGVSYVMCT-GSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSSQDEKGVT 
KGMSYSMCY-GKFKWEEIAETQHGTIVIRVQYEGDGSPCKIPLEIMDLDNRH 
KGMSYAMCL-NTFVLKKEVSETQHGTILIKVEYKGEDAPCKIPFSTEDGQGK& 
KGMSYTMCS-GKFSIDKEMAETQHC5TTVVKVKYEGAGAPCKVPIEIRDVNKEK 
KGTTYGMCT— EKFSFAKNPADTGH.GTWIELSYSGSDGPCKIPI"VSVASLNDM 
KGTTYGVCS-KAFKFLGTPADTGHGTWLELQYTGTDGPCKVPISSVASLNDL 
KGTSYKMCT-DKMSFVKNPTDTGHGTAVMQVKVPKG-APCRIPVlylVADDLTAS 
KGLTYTMCDKTKFTWKRAPTDSGHlDTWMEVTFSGT-KPCRIPVjRAVAHGSPD 
KGLTYTMCDKTKFT WKRAPTDSGH DTWMEVTFS GT-KPCRI PVJRAVAGHS PD 
KGLTYTMCDKSKFAWKRTPTDSGHDTWMEVTFSGS-KPCRIPVEAVAHGSPD 
KGLTYTVCDKTKFTWKRAPTDSGHDTVVMEVGFSGT-RPCRIPVTRAVAHGVPE 
KGTTYSMCDKAKFKWKRVPVDSGHDTWMEVSYTGSDKPCRIPV-RAVAHGVPA 
KGMTYTVCEGSKFAWKRPPTDSGHDTWMEVTYTGS-KPCRIPV-RAVAHGEPN 
KGLTYTM.CDKAKFTWKRAPTDSGHDTWMEVAFSGT-KPCRIP\TRAVAHGSPD 

352 395 
Q-NGRLL TANPIVIDKEK — PVNI EAE-PPFGESYIWGAGEKA^LKLSWFKK 
V-LGRLZ TVNPIVTEKDS — PVNVEAE-PPLGDSYIIIGVEPGQLKLNWFKK 
H-NGRLI TANPWTKKEE— PVNI EAE-PPFGESNIVIGIGDK^LKINWYRK 
V-VGRII SSTPLAENTNS — VTNIELE-RPL— DSYIVIGVGNSA.LTLHWFRK 
T PVGRLVT VNPFVAT S SANSKVLVEME -PPFGDSYI WGRGDKQ INHHWHKA 
TPVGRLVTVNPFVSVATANAKVLXELE-PPFGDSYIWGRGEQQINHHWHKS, 
VNKGILVTVNPIASTNED— EVLX EVN- PPFGDSYI IVGTGDSRLTYQWHKE 

VNVAMLITPNPTIENNGG GFXEMQLPP-GDNIIYVG ELSYQWFQK 

VNVAMLITPNPTIENNGG GFXEMQLPP-GDNIIYVG ELSHQWFQK 

VNVAMLITPNPTIENDGG GFXEMQLPP-GDNIIYVG ELSHQWFQT 

VNVAMLITPNPTMENNGG GFXEMQLPP-GDNIIYVG DLNHQWFQK 

VNVAML I T PNPT IETNGG GFXEMQLPP-GDNIIYVG DLSQQWFQK 

VNVASLITPNPSMENTGG GFVELQLPP-GDNI IYVG ELSHQWFQK 

VDVAML X T PNPT IENNGG GFXEMQLPP-GDNIIYVG ELKHQWFQK 
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SEQUENCE LISTING 

<1X0> BARRETT, ALAN 
BEASLEY, DAVID 
HOLBROOK, MICHAEL 

<12 0> Compositions And Methods Related To :Flavivirus Envelope 
Protein Domain II X Antigens 

<13 0> UTFG:260WO 

<14 0> UNKNOWN 
<141> 2003-08-18 

<1S0> 60/445,581 
<151> 2003-02-06 

<150> 60/403,893 
<151> 2002-08-16 

<160> 25 

<170> Patentln Ver. 2.X 

<2 10> 1 
<211> 10962 
<212> DNA 

<2 13> West Nile virus 

<220> 

<2 21> CDS 

<222> (97) . . (10389) 

<400> 1 

agtagttcgc ctgtgtgagc tgacaaactt agtagtgt-fct gtgaggatta acaacaa-tta 60 

acacagtgcg agctgtttct tggcacgaag atctcg atg tct aag aaa cca gga 114 

MCet Ser Lys Lys Pro GL.y 
1 5 

ggg ccc ggt aaa aac egg get gtc aat atg eta aaa cgc ggt atg ccc 162 
Gly Pro Gly Lys Asn ftrg Ala Val Asn Met Lieu Lys Arg Gly Met Prro 
10 15 20 

cgc gga ttg tec ttg ata gga eta aag agg get atg ctg agt ctg afct 210 
A.nrg Gly Leu Ser Leu He Gly Leu Lys Arg Ala Met Leu Ser Leu He 
25 30 35 

gac ggg aag ggc cca ata cgt ttc gtg ttg get ctt ttg gcg ttt tfcc 25 8 
A^sp Gly Lys Gly Pro He Arg Phe Val Leu Ala Leu Leu Ala Phe Ptie 
40 45 50 

aga ttc act gca ate get ccg act cgt gcg gtg ctg gac aga tgg aga 306 
.Z^trg Phe Thr Ala lie Ala Pro Thr Arg Ala "Val Leu Asp Arg Trp Arrg 
55 60 65 -70 

ggc gtc aac aaa caa aca gca atg aag cat etc ttg agt ttc aag aaa 354 
Gly Val Asn Lys Gin Thr Ala Met Lys His lieu Leu Ser Phe Lys Iiys 
75 80 85 

gaa eta gga act ctg ace agt gec ate aac cgc egg age aca aaa caa 402 
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Glu Leu Gly Thr Leu Tlir Ser Ala He Asn Arg Arg Ser Thr Lys GLn 
90 95 100 

aa.g aaa aga gga ggc aca gcg ggc ttt act ate ttg ctt ggg ctg ate 
Lys Lys Arg Gly Gly Thr Ala Gly Phe Thr He Leu Leu Gly Leu He 
105 110 115 

gee tgt get gga get gtg acc etc teg aac ttc cag ggc aaa gtg atg 
Ala Cys Ala Gly Ala Val Thr Leu Ser Asn 3?he Gin Gly Lys Val Met 
120 125 130 

atg aca gtc aat gca acc gat gtc act gac gtg att acc att cca aca 
Met Thr Val Asn Ala Thr Asp Val Thr Asp "Val He Thr He Pro Tktr 
135 140 0.45 ISO 

get get ggg aaa aac ctg tgc ate gta agg get atg gac gta gga tac 
Ala Ala Gly Lys Asn Leu Cys He Val Arg Ala Met Asp Val Gly Tyr 
155 160 165 

att tgt gag gat act ate act tat gaa tgt ccg gtc eta get get gga 
L.eu Cys Glu Asp Thr He Thr Tyr Glu Cys Pro Val Leu Ala Ala Gly 
170 175 180 

aat gac cct gaa gac att gac tgc tgg tgc acg aaa tea tct gtt tac 
Asn Asp Pro Glu Asp Tie Asp Cys Trp Cys Thr Lys Ser Ser Val Tyr 
185 190 195 

gtg cgc tat gga aga tgc aca aaa act egg cat tec cgt cga age aga 
Val Arg Tyr Gly Arg Cys Thr Lys Thr Arg His Ser Arg Arg Ser Arg 
200 205 210 

agg tct ctg aca gtc cag aca cat gga gaa agt aca ctg gee aac a.ag 
Arg Ser Leu Thr Val Gin Thr His Gly Glu Ser Thr Leu Ala Asn Lys 
215 220 225 230 

aaa gga get tgg ttg gac age aca aaa gec acg aga tat ctg gtg =aag 
Lys Gly Ala Trp Leu Asp Ser Thr Lys Ala Thr Arg Tyr Leu Val Lys 

235 240 245 

aca gaa tea tgg ata etg aga aac ccg ggc tac gec etc gtt gca get 
Thr Glu Ser Trp lie Leu Arg Asn Pro Gly- Tyr Ala Leu Val Ala Z\1sl 
250 255 260 

gtc att gga tgg atg eta gga age aac aca atg caa cgc gtc gtg -ttt 
Val He Gly Trp Met Leu Gly Ser Asn Thr- Met Gin Arg Val Val 3?he 
265 270 275 

gec att eta ttg etc ctg gtg gca cca gca tac age ttc aac tgt tta 
Ala He Leu Leu Leu Leu Val Ala Pro Ala Tyr Ser Phe Asn Cys Leu 
280 285 290 

gga atg agt aac aga gac ttc ctg gag gga gtg tct gga get aca tgg 
Gly Met Ser Asn Arg Asp Phe Leu Glu Gly" Val Ser Gly Ala Thr Trp 
295 300 305 310 

gtt gat ctg gta ctg gaa ggc gat agt tgt gtg acc ata atg tea aaa 
Val Asp Leu Val Levi Glu Gly Asp Ser Cys Val Thr He Met Ser Lys 
31 5 3 2 0 325 

gac aag cca acc att gat gtc aaa atg atg aac atg gaa gca gec aac 
Asp Lys Pro Thr He Asp Val Lys Met Met Asn Met Glu Ala Ala Asn 
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330 335 340 

etc gca gat gtg cgc agt tac tgt tac eta get teg gtc agt gac ttg 1170 

Leu Ala Asp Val Arg Ser Tyr Cys Tyr Lesu Ala Ser Val Ser Asp> Leu 
345 350 355 

tea aca aga get gcg tgt cca ace atg ggt gaa gee cac aae gag aaa 1218 

Ser Thr Arg Ala Ala Cys Pro Thr Met Gly Glu Ala His Asn Glu. Lys 
360 365 370 

aga get gac ccc gec ttc gtt tgc aag caaa ggc gtt gtg gac aga. gga 12 66 

Arg Ala Asp Pro Ala. Phe Val Cys Lys Gin Gly Val Val Asp Arg Gly 

375 380 385 390 

tgg gga aat ggc tgc gga ctg ttt gga aag ggg age att gac aca tgt 1314 

Trp Gly Asn Gly Cys Gly Leu Phe Gly Irys Gly Ser lie Asp Thar Cys 
395 4 O0 405 

gcg aag ttt gec tgt aca ace aaa gca act gga tgg ate ate cag aag 13 62 

Ala Lys Phe Ala Cys Thr Thr Lys Ala TZhr Gly Trp lie He Glzn Lys 
410 415 420 

gaa aac ate aag tat gag gtt gec ata ttt gtg cat ggc ccg acg ace 1410 

Glu Asn He Lys Tyar Glu Val Ala lie E>he Val His Gly Pro Th.r Thr 
425 430 435 

gtt gaa tct cat ggc aag ata ggg gec acc cag get gga aga ttc agt 1458 

Val Glu Ser His Gly Lys He Gly Ala Thr Gin Ala Gly Arg Plxe Ser 
440 445 450 

ata act cca teg gcg cca tct tac acg c=ta aag ttg ggt gag tat ggt 1506 

He Thr Pro Ser Ala Pro Ser Tyr Thr Hieu Lys Leu Gly Glu Tyr Gly 

455 460 465 " 470 

gag gtt acg gtt gat tgt gag cca egg tea gga ata gac acc age gee 1554 

Glu Val Thr Val Asp Cys Glu Pro Arg Ser Gly He Asp Thr Ser Ala 
475 480 4S5 

tat tac gtt atg tea gtt ggt gag aag tec ttc ctg gtt cac cga gaa 1602 

Tyr Tyr Val Met Ser Val Gly Glu Lys Ser Phe Leu Val His Aarg Glu 
490 495 500 

tgg ttt atg gat ctg aac ctg cca tgg age agt get gga age acc acg 1650 

Trp Phe Met Asp Leu Asn Leu Pro Trp Ser Ser Ala Gly Ser Tlhr Thr 
505 510 515 

tgg agg aac egg gaa aca ctg atg gag ttt gaa gaa cct cat gec acc 1698 

Trp Arg Asn Arg Glu Thr Leu Met Glu Phe Glu Glu Pro His A_la Thr 
520 525 530 

aaa caa tct gtt gtg get eta ggg teg cag gaa ggt gcg ttg cac caa 1746 

Lys Gin Ser Val Val Ala Leu Gly Ser Gin Glu Gly Ala Leu HCis Gin 

535 540 545 " 550 

get ctg gec gga gcg att cct gtt gag ttc tea age aac act gtg aag 1794 

Ala Leu Ala Gly Ala He Pro Val Glu Phe Ser Ser Asn Thr Val Lys 
555 560 565 

ttg aca tea gga cat ctg aag tgt egg gtg aag atg gag aag ttg cag 1842 

Leu Thr Ser Gly His Leu Lys Cys Arg Val Lys Met Glu Lys r_.eu Gin 
570 575 580 
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ctg aag gga aca aca tat gga gta tgt tea aaa gcg ttc aaa. ttc get 1890 
Leu Lys Gly Thr Thr Tyr Gly Val Cys Ser Lys Ala Phe Lys Phe Ala 
585 590 595 

agg act ccc get gac act ggc cac gga acg gtg gtg ttg gaa ctg caa 193 8 
Arg Thr Pro Ala Asp Thr Gly His Gly Thr Val Val Leu Glu. Leu Gin 
600 605 610 

tat acc gga aca gac ggt ccc tgc aaa gtg ccc att tct tec gta get 1986 
Tyr Thr Gly Thr Asp Gly Pro Cys Lys Val Pro He Ser Sear Val Ala 
615 620 625 630 

tec ctg aat gac etc aca cct gtt gga aga ctg gtg acc gtg aat cca 2034 
Ser Leu Asn Asp Leu Thr Pro Val Gly Arg Leu Val Thr Val Asn Pro 
635 640 645 

ttt gtg tct gtg gec aca gec aac teg aag gtt ttg att gaa etc gaa 2082 
Phe Val Ser Val Ala Thr Ala Asn Ser" Lys Val Leu He Glu. Leu Glu 
650 655 660 

ccc ccg ttt ggt gac tct tac ate gtg- gtg gga aga gga gaa cag cag 2130 
Pro Pro Phe Gly JA.sp Ser Tyr He Val Val Gly Arg Gly Glu Gin Gin 
665 670 675 

ata aac cat cac tgg cac aaa tct ggg- age age att gga aag gec ttt 217 8 
He Asn His His Trp His Lys Ser Gly Ser Ser He Gly Lys Ala Phe 
680 685 690 

acc acc aca etc aga gga get caa cga. etc gca get ctt gga gat act 2226 
Thr Thr Thr Leu Arg Gly Ala Gin Argr Leu Ala Ala Leu Gly Asp Thr 
695 700 705 710 

get tgg gat ttt gga tea gtt gga ggg gtt ttc acc tea gtg ggg aaa 2274 
Ala Trp Asp Phe Gly Ser Val Gly Gly Val Phe Thr Ser Va_l Gly Lys 
715 720 725 

gec ata cac caa gtc ttt gga gga get: ttt aga tea etc ttt gga ggg 2322 
Ala He His Gin Val Phe Gly Gly Ala. Phe Arg Ser Leu Plxe Gly Gly 
730 735 74=0 

atg tec tgg ate aca cag gga ctt ctg gga get ctt ctg ttg tgg atg 2370 
Met Ser Trp He Thr Gin Gly Leu Leu Gly Ala Leu Leu Leu Trp Met 
745 750 755 

gga ate aat gec cgt gac agg tea att get atg acg ttt ctt gcg gtt 2418 
Gly He Asn Ala Arg Asp Arg Ser He Ala Met Thr Phe Leu Ala Val 
760 765 770 

gga gga gtt ttg etc ttc ctt teg gtc aac gtc cat get gac aca ggc 2466 
Gly Gly Val Leu. Leu Phe Leu Ser VaU Asn Val His Ala Asp Thr Gly 
775 780 785 790 

tgt gee att gat att ggc agg caa ga.g etc egg tgc gga agt gga gtg 2514 
Cys Ala He Asp He Gly Arg Gin G3.-U Leu Arg Cys Gly Ser Gly Val 
795 800 805 

ttt ate cac aac gat gtg gaa gec tgg atg gat cgt tac aag ttc tac 2562 
Phe He His Asn. Asp Val Glu Ala Tx~p Met Asp Arg Tyr Lys Phe Tyr 
810 83_5 820 
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ccg gag acg cca cag ggc eta gca aaa att ate cag aaa gca cat gca 2610 
Pro Glu Thr Pro Gin Gly Leu Ala Lys lie He Gin Lys A2_a His Ala 
825 830 835 

gaa gga gtc tgc ggc ttg cgt tec gtt tec aga etc gag cac caa atg 2658 
Glu Gly Val Cys Gly Leu Arg Ser Va.1 Ser Arg Leu Glu His Gin Met 
840 845 850 

tgg gaa gee att aag gat gag ctg aac acc ctg ttg aaa gag aat gga 2706 
Trp Glu Ala IX e Lys Asp Glu Leu Asm Thr Leu Leu Lys GILu Asn Gly 
855 860 865 870 

gtc gac ttg agrt gtc gtg gtg gaa aaa cag aat ggg atg tac aaa gca 2754 
Val Asp Leu Ser Val Val Val Glu L,ys Gin Asn Gly Met Tyr Lys Ala 
875 880 885 

gca cca aaa cgt ttg get gec acc acc gaa aaa ctg gag atg ggt tgg 2802 
Ala Pro Lys Arg Leu Ala Ala Thr Trxar Glu Lys Leu Glu Met Gly Trp 
890 895 9 00 

aag get tgg ggc aag agt ate ate ttt gcg cca gaa eta get aac aac 2850 
Lys Ala Trp Gly Lys Ser lie He Ptie Ala Pro Glu Leu A_la Asn Asn 
905 910 915 

acc ttt gtc ate gac ggt cct gag act gag gaa tgc cca a.cg gec aac 2898 
Thr Phe Val I He Asp Gly Pro Glu Tiir Glu Glu Cys Pro T*hr Ala Asn 
920 925 930 

cga gca tgg aac agt atg gag gta gag gac ttt gga ttt gga ctg aca 2 946 
Arg Ala Trp Asn Ser Met Glu Val Glu Asp Phe Gly Phe Sly Leu Thr 
935 940 945 950 

age act cgc a-tg ttc ctg agg att egg gaa acg aac aca acg gaa tgc 2994 
Ser Thr Arg Met Phe Leu Arg He A^rg Glu Thr Asn Thr Thr Glu Cys 
955 960 965 

gac teg aag ate ata gga acc gec gtc aag aac aac atg get gtg cat 3042 
Asp Ser Lys lie He Gly Thr Ala Val Lys Asn Asn Met 2U.a Val His 
970 975 980 

agt gat eta tea tac tgg ata gag age gga etc aac gac acc tgg aag 3090 
Ser Asp Leu Ser Tyr Trp He Glu Ser Gly Leu Asn Asp Ohr Trp Lys 
985 990 995 

ctt gag agg gcg gtt eta gga gaa gtc aaa tea tgc acc tgg cca gaa 3138 
Leu Glu Arg Ala Val Leu Gly Glu Val Lys Ser Cys Thr "Trp Pro Glu 
1000 1005 1010 

acc cac act ctg tgg ggt gat gga gtt ctg gaa agt gat etc ate ata 3186 
Thr His Thr Leu Trp Gly Asp Gly Val Leu Glu Ser Asp leu He He 
1015 1020 1025 1030 

ccc ate acc ttg gca gga ccc aga age aac cac aac agg aga cca ggg 3234 
Pro lis Thr Leu Ala Gly Pro Arg Ser Asn His Asn Arg Arg Pro Gly 
1035 1040 1045 

tac aaa act crag aac caa ggc cca tgg gat gag ggg cgc gtc gag att 3282 
Tyr Lys Thr csln Asn Gin Gly Pro Trp Asp Glu Gly Arg Val Glu He 
1O50 X055 3_060 



gac ttt gac tat tgc cca gga aca aca gta act ata agt gac agt tgc 
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Asp Phe Asp Tyr Cys Pro Gly Tnr Thr Val Thr He Ser Asp Ser Cys 
10S5 1070 1075 

gaa cac cgt gga cct gcg gca cgc aca acc act gag agt ggg aag etc 

Glu His Arg Gly Pro Ala Ala Arg Thr Thr Thr Glu Ser Gly Lys Leu 
1080 1085 1090 



ate aca gac tgg tgc tgc aga agt fcgc acc etc cct cca 
He Thr Asp Trp Cys Cys Arg Ser Cys Thr Leu Pro Pro 
1095 1100 1105 



ctg cgc ttc 
Leu Arg Phe 
1110 



cag act gag aat ggc tgt tgg tat gga atg gaa att cga cct acg egg 
Gin Thr Glu Asn Gly Cys Trp Tyr Gly Met Glu He Arg Pro Thr Arg 
1115 1120 1125 

cac gac gaa aag acc etc gtg caa "teg aga gtg aat gca. tac aac gec 
His Asp Glu Lys Thr Leu Val Gin Ser Arg Val Asn Ala. Tyr Asn Ala 
0.13 0 1135 1140 

gac atg att gat cct ttt cag ttg ggc ctt atg gtc gtgj ttc ttg gee 
Asp Met He Asp Pro Phe Gin Leu Gly Leu Met Val Val. Phe Leu Ala 



acc cag gagr gtc ctt cgc aag agg tgg acg gee aag ate age att cca 
Thr Gin Glu Val Leu Arg Lys Arg Trp Thr Ala Lys Ilea Ser He Pro 
1160 11S5 1170 



get ate atgj ctt gca etc eta gtc 
Ala He Met Leu Ala Leu Leu Val 
1175 1180 

act gat gtc ctg cga tat gtc att 
Thr Asp Val Leu Arg Tyr Val lie 
1195 



eta gtg ttt ggg ggfc att acg tac 
Leu Val Phe Gly Gly He Thr Tyr 
1185 1190 

etc gtc ggc gee gcg ttt get gaa 
Leu Val Gly Ala Ala Phe Ala Glu 
1200 1205 



gca aac tea gga gga gac gte gtg cac ttg gca ctt atg get aca ttc 

Ala Asn Sear Gly Gly Asp Val Val His Leu Ala Leu Met Ala Thr Phe 

1210 3.215 1220 

aag att caa cca gtc ttt ctg gtg get tec ttt ttg aag gca agg tgg 

Lys He Gin Pro Val Phe Leu Val Ala Ser Phe Leu Lys Ala Arg Trp 

12 2 5 1230 123 S 

acc aac caa gag agt att ttg etc atg ctt gca get gc t ttc ttc caa 

Thr Asn Gin Glu Ser He Leu Leu Met Leu Ala Ala Al a Phe Phe Gin 

1240 1245 1250 

atg get tac tat gac gee aag aat gtt ctg tea tgg ga_a gtg cct gac 

Met Ala Ty;r Tyr Asp Ala Lys Asn. Val Leu Ser Trp Gl_u Val Pro Asp 

1255 1260 1265 1270 

gtt ttg aac tct etc tec gtt gcg- tgg atg att etc aga get ata age 

Val Leu Asn Ser Leu Ser Val Ala. Trp Met He Leu Arrg Ala He Ser 

1275 1280 1285 

ttc acc aac act tea aat gtg gtg- gtg ccg ctg ctg gac ctt ttg aca 

Phe Thr Assn Thr Ser Asn Val Val Val Pro Leu Leu A3- a Leu Leu Thr 



cct gga ttg aaa tgc tta aac ctt: gat gtg tac aga att ttg eta etc 
Pro Gly Leu Lys Cys Leu Asn Leu. Asp Val Tyr Arg He Leu Leu Leu 
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1305 X310 1315 

atg gtt gg a gtt gga age etc ate aaa gaa aaa ags age tct gca gca 4»98 
Met Val Gly Val Gly Ser Leu lie Lys Glu Lys Arg Ser Ser Ala Ala 
1320 1325 133 O 

aaa aag- aaa gga get tgc etc ate tgc eta gcg ctg gcg tct aca gga 4 7L.46 
Lys Lys Lys Gly Ala Cys Leu lie Cys Leu Ala Le-u Ala Ser Thr Gly 
1335 1340 1345 1350 

gtg ttc aat cca atg ata ett gca get ggg eta atg get tgc gac ccc 4 3.94 
Val Phe Asn Pro Met He Leu Ala Ala Gly Leu Mat Ala Cys Asp Pro 
1355 1350 1365 

aac ogc aag egg ggc tgg cct get aca gaa gtg atg act gca gtt gga 4 242 
Asn Arg Lys Arg Gly Trp Pro Ala Thr Glu Val Meat Thr Ala Val Gly 
1370 1375 1380 

etc atg ttt gee ate gtt ggg ggt ctg gca gaa cfct gac ata gat tct 412 90 
Leu Met Phe Ala He Val Gly Gly Leu Ala Glu Leu Asp He Asp Ser 
1385 1390 1395 

atg get ate ccc atg acc ate gee gga ett atg tfcrc gcg gca ttt gtc 4 338 
Met Ala He Pro Met Thr He Ala Gly Leu Met Pfcie Ala Ala Phe Val 
140 O 1405 14 3.0 

ate tct gga aag tea aca gac atg tgg att gag agg acg get gac att 4=386 
He Sear Gly Lys Ser Thr Asp Met Trp He Glu Arrg Thr Ala Asp He 
1415 1420 1425 1430 

act tg-g gag agt gat get gaa ate aca ggc tct age gaa aga gta gat <i=434 
Thr Trp Glu Ser Asp Ala Glu. He Thr Gly Ser S er Glu Arg Val Asp 
1435 1440 1445 

gtg agg ctg gat gat gat gga aat ttt caa ctg a_tg aat gac ccc ggg 4482 
Val Arrg Leu Asp Asp Asp Gly Asn Phe Gin Leu MCet Asn Asp Pro Gly 
1450 1455 1460 

gca caa tgg aaa att tgg atg ett agg atg gec tgc ctg gcg ata agt 4530 
Ala Pro Trp Lys He Trp Met Leu Arg Met Ala Cys Leu Ala He Ser 
1465 1470 1475 

gec tac aca cct tgg gca att etc ccc teg gtc ate gga ttc tgg ata -4578 
Ala Tyr Thr Pro Trp Ala He Leu Pro Ser Val Xle Gly Phe Trp He 
1480 1485 1390 

acc ett cag tac aca aag aga gga ggt gtt ett tgg gac aca cca tea -4626 
Thr Leu Gin Tyr Thr Lys Arrg Gly Gly Val Leu Irp Asp Thr Pro Ser 
1495 1500 1505 1510 

ccc aag gag tac aag aag ggt gat acc acc act ggc gtt tac aga ate 4 674 
Pro L.ys Glu Tyr Lys Lys Gly Asp Thr Thr Thr Gly Val Tyr Arg He 
1515 1520 1525 

atg act cga ggt ctg ett ggc agt tac caa get gga gec gga gtg atg 4722 
Met Thr Arg Gly Leu Leu GILy Ser Tyr Gin Ala Gly Ala Gly Val Met 
1530 1535 1540 

gta gag ggg gtg ttc cac aca eta tgg cac acc act aag gga get get 4770 
Val Glu Gly Val Phe His Thr Leu Trp His Thr Thr Lys Gly Ala Ala 
1545 1550 1555 
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etc atg agt ggt gag gga cgt ctg gat ccc tac tgg ggg age gtg aaa 4818 
Leu Met Ser Gly Glu Gly Arg Leu Asp Pro Tyr Trp Gly Ser Val LyS 
1560 1-565 3.570 

gag gac cga ctt tgc tat ggg ggg cca tgg aaa etc caa cat aaa tgg 4866 
Glu Asp Arg Leu Cys Tyr Gly Gly Pro Trp Lys Leu Gin His Lys Trp 
1575 1580 1585 1590- 

aat gga cat gat gag gtc caa atg att gtc gtg gag cca ggg aaa aat 4 914 
Asn Gly His Asp Glu Val Gin Met lie Val Val Glu Pro Gly Lys Asm 
1595 1600 1605 

gtg aaa aac gtc cag acc aag ccc gga gtg ttt aag aca cca gaa gga 4962 
Val X.ys Asn Val Gin Thr Lys Pro Gly Val Phe Lys Thr Pro Glu Gly 
1610 1615 1620 

gaa att ggg gca gtt acg eta gac tat cct acc gga acg tea ggt tec 5 010 
Glu He Gly Ala Val Thr Leu Asp Tyr Pro Thr* Gly Thr Ser Gly Ser 
1625 1630 1635 

ccc att gta gac aaa aat gga gat gtg att gga. ttg tat ggg aac ggc 5058 
Pro He Val Asp Lys Asn Gly Asp Val lie Gly- Leu Tyr Gly Asn Gly 
0.640 1645 1650 

gtc ate atg cct aat ggt tea tac ata age gec: att gtg caa gga gag 510 6 
Val lie Met Pro Asn Gly Ser Tyr He Ser Ala. He Val Gin Gly Glu 
1655 1660 1665 1670 

aga atg gaa gaa ccg gca cca get ggc ttc gaa. cct gaa atg ttg agg 5154 
Arg Met Glu Glu Pro Ala Pro Ala Gly Phe Gltn. Pro Glu Met Leu Arg 
1675 1680 1685 

aag aaa cag ate act gtc ctt gat ctg cac cccr: gga gca gga aag aca 52 02 
Lys Lys Gin He Thr Val. Leu Asp Leu His Pro Gly Ala Gly Lys Thr 
1690 1695 1700 

cgc aag ata ctt ccc caa ate ate aag gag gec ate aac aaa aga ttg 52 5 0 
Arg Lys He Leu Pro Gin He He Lys Glu Ala. He Asn Lys Arg Leu 
1705 1710 1715 

agg acg get gtg ctg gca. ccc acc agg gtc gtt; get get gag atg tct 5298 
Arg Thr Ala Val Leu Ala. Pro Thr Arg Val Vail Ala Ala Glu Met Ser- 
3.720 1725 1730 

gag gee ctg aga gga ctt ccc att egg tac caa acc tea gca gtg cac 5346 
Glu. Ala Leu Arg Gly Leu. Pro He Arg Tyr Glzn Thr Ser Ala Val His 
1735 1740 1745 1750 

aga gag cac agt gga aat gag ate gtt gat gt <c atg tgc cat gee acc 5394 
Arg Glu His Ser Gly Asn Glu He Val Asp Va 1 Met Cys His Ala Thrr 
1755 1760 1765 

etc aca cac agg ctg atg tct cca cac aga gt c ccc aac tac aac ctg 5442 
Leu. Thr His Arg Leu Met Ser Pro His Arg Val Pro Asn Tyr Asn Leux 
1770 1775 1780 

ttc ata atg gat gaa gec cat ttc acg gat cca gcg age ate gca gec 5490 
Phe He Met Asp Glu Ala His Phe Thr Asp Pr~o Ala Ser He Ala Ala. 
1785 1790 1795 
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aga gga tac at a gca acc aag gtt gaa ttg ggc gaa gcc gcc gcg att 

Arg Gly Tyr He Ala Thr Lys Val Glu Leu Gly Glu Ala Ala Ala 3le 
1800 1805 1810 

ttc atg acg gca acg cca ccc ggg act tct gac ccc ttt cca gag -fcct 

Phe Met Thr Ala Thr Pro Pro Gly Thr Ser Asp Pro Phe Pro Glu Ser 
1815 ]_820 0_825 1830 

aat get cct ate teg gac atg caa aca gag ate cca gac aga gcc -tgg 

Asn Ala Pro He Ser Asp Met Gin Thr Glu He Pro Asp Arg Ala Trp 
1835 1840 1845 

aac act gga tat gaa tgg ata act gag tat gtt gga aag acc gtt tgg 

Asn Thr Gly Tyr Glu Trp He Thr Glu Tyr Val Gly Lys Thr Val Trp 
1850 1855 18S0 

ttt gtt cca agt gtg aaa atg gga aat gag att gcc etc tgt ctg caa 

Phe Val Pro Ser Val Lys Met Gly Asn Glu He Ala Leu Cys Leu Gin 
1865 1870 1875 

egg gcg ggg aag aag gtt ate cag ctg aac aga aag tec tat gag aca 

Arg Ala Gly Lys Lys Val He Gin Leu Asn Arg Lys Ser Tyr Glu Thr 
1880 1885 1890 

gag tac ccc aag tgt aag aac gat gat tgg gat ttt gtc ate acc aca 

Glu Tyr Pro Lys Cys Lys Asn Asp Asp Trp Asp Phe Val He Thr Thr 
1895 3.900 ZL905 3.910 

gac ata tea gaa atg gga gcc aac ttc aag gcg age aga gtg ate gac 

Asp He Ser Glu Met Gly Ala Asn Phe Lys Ala Ser Arg Val lie Asp 
1915 192 0 1925 

age cgc aaa age gtg aaa ccc acc ate att gag gaa ggt gat gga aga 

Ser Arg Lys Ser Val Lys Pro Thr He lie Glu Glu Gly Asp Gly Arg 
1930 1935 1940 

gtc ate ctg ggg gaa. ccc tea gcc ate acgr get gcc age get get cag 

Val He Leu Gly Glu. Pro Ser Ala He Thr- Ala Ala Ser Ala Ala Gin 
1945 1950 1955 

egg aga gga cgc ata. gga aga aac cca tea. caa gtt ggt gat gag tat 

Arg Arg Gly Arg Ilei Gly Arg Asn Pro Ser~ Gin Val Gly Asp Glu Tyr 
1960 1965 1970 

tgc tat gga ggg cac aca aat gag gat gat tec aac ttt get cac tgg 

Cys Tyr Gly Gly His Thr Asn Glu Asp Asp? Ser Asn Phe Ala His Trp 
1975 1980 1985 3.990 

aca gag get cgc ate atg eta gac aac ate: aac atg ccg aat ggt ctg 

Thr Glu Ala Arg I1& Met Leu Asp Asn He Asn Met Pro Asn Gly Leu 
1993 2000 2005 

gtg get caa eta tat cag cct gag cgc gag aag gtg tac acc atg gac 

Val Ala Gin Leu Tyr Gin Pro Glu Arg Glu Lys Val Tyr Thr Met Asp 
2010 2015 2020 

ggg gaa tac agg etc aga ggg gaa gaa egg aag aac ttc ctt gaa. ttc 

Gly Glu Tyr Arg Leia Arg Gly Glu Glu Arg Lys Asn Phe Leu Glu. Phe 
2025 2030 2035 

ctg aga aca get gafc tta cca gtc tgg etc get tac aaa gtg gca. gca 
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Leu Arg Thr Ala Asp Leu Pro Val Trp Leu Ala Tyr Lys Val AZLa Ala 
2040 2045 2050 

gca gga ata tea fcac cat gac egg aaa tgg tgc ttt gat gga c ct cga 630S 
Ala Gly lie Ser Tyr His Asp Arg Lys Tarp Cys Phe Asp Gly Pro Arg 
2055 2060 2065 2070 

ace aac acg att ctt gaa gac aac aat ga_a. gtt gaa gtc ate acg aag 6354 
Thr Asn Thr He Leu Glu Asp Asn Asm Gin Val Glu Val He Thr Lys 
2 075 208 O 20 85 

tfc 9 ggt gag aga aag ate eta aga ccc ag-g tgg gca gat get aga gtg 64 02 
Leu Gly Glu Arg Lys He Leu Arg Pro Arrg Trp Ala Asp Ala 7Xrg Val 
2090 2095 2100 

tac tea gac cat caa get eta aag tec ttc aaa gat ttt gca teg ggg 645 0 
Tyr Ser Asp His Gin Ala Leu Lys Ser Plxe Lys Asp Phe Ala Ser Gly 
2105 2110 2115 

aaa cga tea caa ate ggg etc gtt gag gtg etc ggg aga atg cct gaa 6498 
Lys Arg Ser Gin He Gly Leu Val Glu Val Leu Gly Arg Met E?ro Glu 
2120 2125 2130 

cac ttc atg gtg aaa act tgg gag gca ttg gac acg atg tat gtg gtg 6546 
His Phe Met Val Lys Thr Trp Glu Ala Leu Asp Thr Met Tyr "Val Val 
2135 2140 2145 2150 

gcg acc get gaa aaa gga ggc cga get cac agg atg get ctt gag gag 6594 
Ala Thr Ala Glu Lys Gly Gly Arg Ala His Arg Met Ala Leu Glu Glu 
2155 2160 2 165 

eta ccg gac gec ctt cag aca ata gtt ttg att gca eta ttg agt gtg 6642 
Leu Pro Asp Ala Leu Gin Thr He Val lieu He Ala Leu Leu Ser Val 
2170 2175 2180 

atg tee tta ggt gtg ttt ttt eta etc a_tg caa agg aag ggc att ggt 6690 
Met Ser Leu Gly Val Phe Phe Leu Leu MCet Gin Arg Lys Gly He Gly 
2185 2190 2195 

aag att ggc ttg gga gga gta ate tta grga get gee aca ttc ttc tgc 673 8 
Lys He Gly Leu Gly Gly Val He Leu Gly Ala Ala Thr Phe Phe Cys 
2200 2205 2210 

tgg atg get gaa gtc cca gga acg aaa ata gca ggc atg etc ctg ctt 6786 
Trp Met Ala Glu. Val Pro Gly Thr Lys He Ala Gly Met Leu Leu Leu 
2215 2220 2225 2230 

tec ctg ctg etc: atg att gtt ttg att crcg gag ccg gaa aag cag cgc 6834 
Ser Leu Leu Leix Met He Val Leu He Pro Glu Pro Glu Lys Gin Arg 
2235 2240 2245 

tea cag act gat aac cag etc gec gtg ttc ttg ate tgt gtg etc aca 6882 
Ser Gin Thr Asgj Asn Gin Leu Ala Val 3?he Leu He Cys Val Leu Thr 
2250 2255 2260 

ctg gtc ggc gec gtg get gec aat gaa atg ggc tgg ctg gac aag acc 693 0 
Leu Val Gly Ala Val Ala Ala Asn Glu Met Gly Trp Leu Asp Lys Thr 
2265 2270 2275 

aag aat gac att ggc age ctg ttg ggg cac agg cca gaa get aga gag 6978 
Lys Asn Asp He Gly Ser Leu Leu Gly His Arg Pro Glu Ala. Arg Glu 
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acg acc ctg gga gtt gag age ttc tta ctt gat ctg egg ccg gec acg 
Thr Thr Leu Gly Val Glu Ser Phe Leu Leu Asp Leu Arg pro Ala Thr 
2295 2300 2305 2310 

gca tgg teg etc tat gec gta acg aca gec gtt etc acc cct ttg ctg 
Ala Trp Ser I_.eu Tyr Ala Val Thr Thr Ala Val Leu Thr Pro Leu Leu 
2315 2320 2325 

aag cat eta ate acg tea gac tac ate aac act teg ttg acc tea ata 
Lys His Leu He Thr Ser Asp Tyr He Asn Thr Ser Leu Thr Ser He 
2330 2335 2340 

aac gtc caa gec age gcg ttg ttc act ttg gec aga ggc ttc cct ttt 
Asn Val Gin .A.la Ser Ala Leu Phe Thr Leu Ala Arg Gly Phe Pro Phe 
2345 2350 2355 

gtg gac gtt ggt gtg tea get etc ttg ctg gcg gtc ggg tgc tgg ggt 
Val Asp Val Gly Val Ser Ala Leu Leu Leu Ala Val Gly Cys Trp Gly 
2360 2365 2370 

cag gtg act ctg act gtg act gtg act gca get get ctg etc ttt tgc 
Gin Val Thr Leu Thr Val Thr Val Thr Ala Ala Ala Leu leu Phe Cys 
2375 2380 2385 2390 

cac tat get -tac atg gtg cca ggc tgg caa gcg gaa gee atg cga tct 
His Tyr Ala Tyr Met Val Pro Gly Trp Gin Ala Glu Ala Met Arg Ser 
2395 2400 2405 

gee cag egg egg aca get get ggc ate atg aaa aat gta gtg gtg gat 
Ala Gin Arg Arg Thr Ala Ala Gly He Met Lys Asn Val Val Val Asp 
2 410 2415 2 420 

ggg ate gtg gec act gat gta cct gaa ctt gaa cga aca act cca gtc 
Gly He Val Ala Thr Asp Val Pro Glu Leu Glu Arg Thr Thr Pro Val 
2425 2430 2435 

atg cag aaa aaa gtt gga cag ate ata ttg ate ttg gta tea atg gec 
Met Gin Lys Lys Val Gly Gin He He Leu He Leu Val Ser Met Ala 
2440 2445 2450 

gcg gtg gtc gtc aat cca tea gtg aga acc gtc aga gag gec gga att 
Ala Val Val Val Asn Pro Ser Val Arg Thr Val Arg Glu Ala Gly He 
2455 2460 2465 2470 

ctg act aca gca gca gca gtc acc eta tgg gag aat ggt get agt tea 
Leu Thr Thr Ala Ala Ala Val Thr Leu Trp Glu Asn Gly Ala Ser Ser 
2475 2480 ~ 2485 



gtg tgg aat gca acg aca get att ggc ctt tgt cac ate 
Val Trp Asn Ala Thr Thr Ala He Gly Leu Cys His He 
2490 2495 



atg cga gga 
Met Arg Gly 
2500 



gga tgg etc teg tgt etc tec ate atg tgg act etc ate aaa aac atg 

Gly Trp Leu. Ser Cys Leu Ser He Met Trp Thr Leu lie Lys Asn Met 

2505 2510 2515 

gag aaa cca ggc etc aag agg ggt gga gee aaa gga cgc acg eta ggg 

Glu Lys Pro Gly Leu Lys Arg Gly Gly Ala Lys Gly Arg Thr Leu Gly 

2520 2525 2530 
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gaa gtt tgg aag gag aga etc aac cac atg acg aag gaa gaa ttt acc 

Glu Val Tip Lys Glu Arg Leu Asn His Met Thr Lys Glu Glu Phe Thr 

2535 2540 2545 2550 

aga tac aga aaa gaa gec ate act gaa gtt gac cg<s tec gca gca aaa 

Arg Tyr Arg Lys Glu Ala lie Thr Glu Val Asp Arg Ser Ala Ala Lys 

2555 2560 2565 

cat get agg aga gag gga aac ate act gga ggc cac cca gtc tea egg 

His Ala Arg Arg Glu Gly Asn He Thr Gly Gly His Pro Val Ser Arg 

2570 2575 2580 

gga acc gcg aaa tta egg tgg tta gtg gaa agg cg~t ttc etc gag cca 

Gly Thr Ala Lys Leu Arg Trp Leu Val Glu Arg Arg phe Leu Glu Pro 

2585 2 590 2595 

gtg gga aag gtt gtg gat etc ggg tgt ggt aga gg c ggc tgg tgc tat 

Val Gly Lys Val Val Asp Leu Gly Cys Gly Arg Gl-y Gly Trp Cys Tyr 

2600 2605 26L 0 

tac atg get acc cag aag agg gta cag gaa gtg aa_a ggg tac acg aaa 

Tyr Met Ala Thr Gin Lys Arg Val Gin Glu Val Ly~s Gly Tyr Thr Lys 

2615 2620 2625 2630 

gga gga cot ggc cat gaa gaa cca caa ctg gtg ca_g age tat ggt tgg 

Gly Gly Fro Gly His Glu Glu Pro Gin Leu Val GL.n Ser Tyr Gly Trp 

2635 2640 2645 

aat att gtt acc atg aag agt gga gtc gac gtc ttc tac aga cca tea 

Asn He Val Thr Met Lys Ser Gly Val Asp Val Plate Tyr Arg Pro Ser 

2650 2655 2660 

gaa gcg age gac aca ctg etc tgt gac att gga gatg tea teg tea agt 

Glu Ala Ser Asp Thr Leu Leu Cys Asp He Gly Gl_u Ser Ser Ser Ser 

2 665 2670 2675 

gee gag gta gaa gaa cac cgc acc gtc cgt gtc ctrg gag atg gtg gaa 

Ala Glu Val Glu Glu His Arg Thr Val Arg Val Leu Glu Met Val Glu 

2680 2685 2690 

gat tgg ttg cac aga gga ccg aag gaa ttc tgc ate aaa gtg eta tgc 

Asp Trp Leu His Arg Gly Pro Lys Glu Phe Cys He Lys Val Leu Cys 

2695 2700 2705 2710 

cct tac atg ccc aaa gtg att gag aag atg gaa aca etc caa agg cga 

Pro Tyr Met Pro Lys Val He Glu Lys Met Glu Tfcir Leu Gin Arg Arg 

2715 2720 2725 

tat gga ggt ggc ctt ata aga aac ccc ctt tea cgc aac tct acc cat 

Tyr Gly Gly Gly Leu He Arg Asn Pro Leu Ser Astrg Asn Ser Thr His 

2730 2735 2740 

gag atg tac tgg gtg age cac get tea ggc aat ate gtc cac tec gtc 

Glu Met Tyr Trp Val Ser His Ala Ser Gly Asn I He Val His Ser Val 

2745 2750 2755 

aac atg aca age cag gtg ctt ctg ggg agg atg gaa aag aaa aca tgg 

Asn Met Thr Ser Gin Val Leu Leu Gly Arg Met GZLu Lys Lys Thr Trp 

2760 2765 27"70 
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aa 9 9g=>- cc c ca 9 tt:t 9 a 9 gaa gat gtc aac ttg gga agt gga acg egg 
Lys Gl-y Pro Gin Phe Glu Gin Asp Val Asn Leu C31y Ser Gly Thr Arg 
2775 2780 2785 2790 

gca gta ggg aag cct etc etc aat tct gat act age aag ate aag aac 
Ala Val Gly Lys Pro Leu Leu Asn Ser Asp Thr Ser Lys He Lys Asn 
2795 2800 2805 

cga at"t gag agg ctg aag aaa gaa tac age tec aca tgg cac cag gat 
Arg He Glu Arg Leu Lys Lys Glu Tyr Ser Ser Thr Trp His Gin Asp 
2810 2815 2820 

gcg aac cac ccc tae agg ace tgg aac tac cac gga age tat gaa gtg 
Ala Asn. His Pro Tyr Arg Thir Trp Asn Tyr His Gly Ser Tyr Glu Val 
2825 2830 2835 

aaa cca acc ggc tea gee age tec ctt gtg aat ggg gta gtc aga tta 
Lys Piro Thr Gly Ser Ala Ser Ser Leu Val Asn Gly Val Val Arg Leu 
284:0 2845 2:850 

etc tea aaa cca tgg gac act ate acc aat gtg acc acg atg gec atg 
Leu Setr Lys Pro Trp Asp Ttir He Thr Asn Val Thr Thr Met Ala Met 
2855 2860 2865 2870 

aca gac acc act cct ttc ggt caa caa cga gtg ttc aag gaa aag gtg 
Thr Asp Thr Thr Pro Phe Gly Gin Gin Arg Val Phe Lys Glu Lys Val 
2875 2880 2885 

gac aca aag get cca gag cct cca gaa gga gtc aaa tac gtc etc aat 
Asp Ttxr Lys Ala Pro Glu Paro Pro Glu Gly Val Lys Tyr Val Leu Asn 
2890 2895 2900 

gag acc acg aac tgg ctg tgg get ttt tta gec cgc gat aag aaa ccc 
Glu Ttir Thr Asn Trp Leu Tarp Ala Phe Leu Ala Arg Asp Lys Lys Pro 
2905 2910 2915 

agg atg tgt tec egg gag gaa ttt att gga aaa gtc aac agt aat gec 
Arg Met Cys Ser Arg Glu Slu Phe He Gly Lys Val Asn Ser Asn Ala 
2920 2925 2930 

gec eta gga gcg atg ttt gaa gaa cag aac caa tgg aag aac gec egg 
Ala Leu Gly Ala Met Phe Glu Glu Gin Asn Gin Trp Lys Asn Ala Arg 
2935 2940 2945 2950 

gaa get gta gag gat cca aag ttt tgg gag atg gtg gat gag gag cgt 
Glu JVla Val Glu Asp Pro Hys Phe Trp Glu Met Val Asp Glu Glu Arg 
2955 2960 2965 

gaa gcg cat etc cgt gga cjaa tgc aac acc tgc ate tac aac atg atg 
Glu A.la His Leu Arg Gly Glu Cys Asn Thr Cys He Tyr Asn Met Met 
2970 2975 2980 

gga aag aga gag aag aag cct gga gag ttc ggc= aaa get aaa ggc age 
Gly Lys Arg Glu Lys Lys Pro Gly Glu Phe Gly Lys Ala Lys Gly Sear 
2985 2990 2995 

aga gec ate tgg ttc atg tgg ctg ggg gec cgc= ttc ctg gag ttt gaa 
Arg Ala He Trp Phe Met Trp Leu Gly Ala Arg Phe Leu Glu Phe Glix 
3O00 3O05 3010 

get etc gga ttc etc aat ^aa gac cac tgg ctg ggt agg aag aac tea 
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Ala Leu Gly Phe Leu AS^a Glu Asp His Trp Lea Gly Arg Lys Asn Ser 
3015 302 0 3025 3030 

gga gga gga gtt gaa gg-c tta gga ctg cag aag: etc ggg tac ate ttg 9234 
Gly Gly Gly Val Glu Gly Leu Gly Leu Gin Lyst Leu Gly Tyr He Leu 
3035 3040 3045 

aag gaa gtt gga aca aag cct gga gga aag gtt tac get gat gat aca 92 82 
Lys Glu Val Gly Thr Lys Pro Gly Gly Lys Val_ Tyr Ala Asp Asp Tfcir 
3050 3055 3060 

gca ggc tgg gac aca cgc ate acc aaa get gac etc gag aat gaa gcg 933 0 
Ala Gly Trp Asp Thr Arrg He Thr Lys Ala Asp* Leu Glu Asn Glu A3. a 
3065 3070 3075 

aag gtt ctt gaa ctg ctg gat gga gaa cat cga cgt tta gcg egg tec 9378 
Lys Val Leu Glu Leu Leu Asp Gly Glu His Argj Arg Leu Ala Arg Ser 
3080 3085 3090 

ate ate gag etc aca tac cga cac aaa gtc gtg aaa gtg atg agg cca 942 6 
He He Glu Leu Thr Tyr Arg His Lys Val Val Lys Val Met Arg P~co 
3095 31O0 3X05 3in_0 

gcg gee gac ggg aaa act gtg atg gac gtc ate; tct aga gag gat cag 9474 
Ala Ala Asp Gly Lys Ttir Val Met Asp Val He Ser Arg Glu Asp GUn 
3115 3120 3125 

aga gga age ggt cag gta gtg act tac gec ctg aac acc ttc acc aat 9522 
Arg Gly Ser Gly Gin Val Val Thr Tyr Ala Leu Asn Thr Phe Thr A.sn 
3130 3135 3140 

eta gca gtt cag ctg gtc aga atg atg gag ggg gag ggg gtc att gga 9570 
Leu Ala Val Gin Leu Val Arg Met Met Glu Gly Glu Gly Val He Gly 
3145 3150 3155 

ccc gat gat gtt gaa aaa ctg gga aaa gga aaa ggc cct aag gtc aga 9618 
Pro Asp Asp Val Glu Lys Leu Gly Lys Gly Lys Gly Pro Lys Val A.rg 
3160 3165 3170 

acc tgg ctg ttt gag aat ggc gag gag cgt etc agt cgc atg gee gtc 9666 
Thr Trp Leu Phe Glu Asn Gly Glu Glu Arg Leu Ser Arg Met Ala Val 
3175 31S0 318S 3L.90 

age ggt gat gac tgc gtg gtg aaa cct ttg gac gac cgc ttc gec a_ca 9714 
Ser - Gly Asp Asp Cys Val Val Lys Pro Leu Asp Asp Arg Phe Ala T"hr 
3195 3200 3205 

tea eta eac ttc eta aat get atg tea aag gtc cgc aaa gac ate cag 9762 
Ser Leu His Phe Leu Asn Ala Met Ser Lys Val Arg Lys Asp He GJln 
3210 3215 3220 

gaa tgg aaa ccc teg acg ggg tgg tat gac tgg cag cag gtt cca ttc 9810 
Glii Trp Lys Pro Ser Tlir Gly Trp Tyr Asp Tarp Gin Gin Val Pro E=he 
3225 3230 3235 

tgt tea aac cat ttc acg gaa ctg ate atg aag gac ggc agg acg ctg 9858 
Cys Ser Asn His Phe Thr Glu Leu He Met Lys Asp Gly Arg Thr Deu 
3240 3245 3250 

gtg- gtc ccg tgt cgt gga caa gac gag ttg att gga cgt gec agg ate 9906 
Val Val Pro Cys Arg Gly Gin Asp Glu Leu lie Gly Arg Ala Arg 3He 
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tct cca ggg get gga tgg aat gtg cgc gac acc gec tgc ctg gcg -aag 9954 

Ser Pro Gly Ala Gly Trp Asn Val Arg Asp Thr Ala Cys Leu Ala iys 
3275 3280 32 85 

tea tac gcg cag atg tgg ctg ctg ctt tat ttc cac cgt aga gac ctg 10002 

Ser Tyr Ala Gin Met Trp Leu Leu Leu Tyr E>he His Arg Arg Asp Leu 
3290 3295 3300 

aga ttg atg gec aat gec ate tgt tec get gtg cct gec aac tgg gtt 10G50 

Arg Leu Met Ala Asn Ala He Cys Ser Ala Val Pro Ala Asn Trp Val 
3305 3310 3315 

ccc aca ggg cgt acc act tgg teg ate cac gca aaa gga gaa tgg atg 10098 

Pro Thr Gly Arg Thr Thr Trp Ser He His Ala Lys Gly Glu Trp Met 
3320 3325 3330 

acg acg gaa gac atg etc gca gtc tgg aac aga gtg tgg att gag gag 10145 

Thr Thr Glu Asp Met Leu Ala Val Trp Asn Arg Val Trp He Glu Glu 
3.335 3340 3 345 ' 3350 

aat gag tgg atg gaa gae aaa aca cca gtt gag agg tgg agt gat gtt 10194 

Asn Glu Trp Met Glu Asp Lys Thr Pro Val Glu Arg Trp Ser Asp Val 



cca tac tct gga aag- aga gag gac att tgg tgt ggc agt ttg ate 

Pro Tyr Ser Gly Lys Arg Glu Asp He Trp Cys Gly Ser Leu He 
3370 3375 " 3380 

aca cga acc cgc gec act tgg get gaa aat ate cat gtg gca ate 

Thr Arg Thr Arg Ala Thr Trp Ala Glu Asn He His Val Ala He 
3385 3390 3395 



tat gtg gat tac atg age 
Tyr Val Asp Tyr Met Ser 
3410 



cag gtc cgt tea gtg att gga gaa gag aag 
Gin Val Arg Ser Val He Gly Glu Glu Lys 
3400 3405 

tec ttg agg agg tat gaa gac acc att gta 
Ser Leu Arg Arg Tyrr Glu Asp Thr He Val 
3415 3420 3425 

taa aagatagtat tatagttagt ttagtgtaaa taggatttat tgagaatgga 



gtg gag gac act gtt 
Val Glu Asp Thr Val 



ttg 
3430 



agtcaggeca 


gattaatget 


gccaccggaa 


gttgagtaga 


cggtgctgcc 


tgcgrgctcaa 


10499 


ccccaggagg 


actgggtgac 


caaagctgeg 


aggtgatcca 


cgtaagccct 


cagstaccgtc 


10559 


teggaaggag 


gaccccacgt 


getttagect 


caaagcccag 


tgtcagacca 


cactttaatg 


10619 


tgccactctg 


cggagagtgc 


agtctgegat 


agtgccccag 


gtggactggg 


ttaacaaagg 10679 


caaaacatcg 


cccoacgcgg 


ccataaccct 


ggctatggtg 


ttaaccaggg 


agaagggact 


10739 


agaggttaga 


ggagaccccg 


cgtaaaaaag 


tgeacg-gccc 


aacttggctg 


aagcrtgtaag 


10799 


ccaagggaag 


gactagaggt 


tagaggagac 


cccgtg-ccaa 


aaacaccaaa 


agasiacagca 


10859 


tattgaeacc 


tgggatagac 


taggggatct 


tctgctctgc 


acaaccagcc 


acacggcaca 


10919 
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gtgcgccgac atag-gtggct ggtggtgcta gaacacagga tct 10962 



<210> 2 
<211> 3430 
<212> PRT 

<213> West Nile virus 
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He 
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He 
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35 
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Val 


Leu 


Asp Arg 


Trp 


Arg 


Gly 


Val 
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X.ys 
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Thr 


Ala Mat 


Lys 


His 


65 








70 










75 








80 


Leu 


Leu 


Ser Phe 


Lys 


Lys 


Glu 


Leu 


Gly 


Thr 


Leu 


Thr 


Ser AL a 


He 


Asn 








85 










90 








95 




Arg 


Arg 


Ser Thar 


Lys 


Gin 


Lys 


Lys 


Arg 


Gly 


Gly 


Thr 


Ala Gl_y 


Phe 


Thr 






10O 
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1L0 






lie 


Leu 


Leu Gly 


Leu 


He 


Ala 


Cys 


Ala 


Gly 


Ala 


Val 


Thr Leu 


Ser 
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115 








120 
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Phe 


Gin 


Gly Lys 


Val 


Met 


Met 


Thr 


Val 


2\sn 


Ala 


Thr 


Asp Val 


Thr 


Asp 




130 
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Val 


lie 


Thr lie 


Pro 


Thr 


Ala 


Ala 


Gly 


Lys 


Asn 


Leu 


Cys H_e 


Val 
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145 








150 










155 








160 


Ala 


Met 


Asp Val 


Gly 


Tyr 


Leu 


Cys 


Glu 


Asp 


Thr 


lie 


Thr Tyr 


Glu 


Cys 








1S5 










OL70 








175 




Pro 


Val 


Leu Ala 


Ala 


Gly 


Asn 


Asp 


Pro 


Glu 


Asp 


He 


Asp Cys 


Trp 


Cys 






18 O 










185 








ISO 






Thr 


Lys 


Ser Sear 


Val 


Tyr 


Val 


Arg 


Tyr 


Gly 


Arg 


Cys 


Thr Lys 


Thr 


Arg 






195 








200 










205 






Hxs 


Ser 


Arg Arg 


Ser 


Arg 


Arg 


Ser 


Leu 


Thr 


Val 


Gin 


Thr His 


Gly 


Glu 




210 








215 










220 








Ser 


Thr 


Leu Al a 


Asn 


Lys 


Lys 


Gly 


Ala 


Trp 


Leu 


Asp 


Ser Thar 


Lys 


Ala 


225 








230 
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Thr 


Arg 


Tyr Leu 


Val 


Lys 


Thr 


Glu 


Ser 


Trp 


He 


Leu 


Arg Asn 


Pro 


Gly 








245 










250 








255 




Tyr 


Ala 


Leu Val 


Ala 


Ala 


Val 


He 


Gly 


Trp 


Met 


Leu 


Gly Ser 


Asn 


Thr 






26 0 










265 
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Met 


Gin 


Arg Val 


Val 


Phe 


Ala 
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Leu 
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Leu 


Val AIL a 
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Ala 
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Ser 


Phe Asn 


Cys 
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Gly 


Met 


Ser 


Asn 


Arg 


Asp 


Phe Leu 
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Gly 
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Val 


Ser 


Gly Ala 


Thr 


Trp 


Val 


Asp 


Leu 


Val 


Leu 


Glu 


Gly Asp 


Ser 


Cys 


305 








310 










315 
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Val 


Thr 


lie Met 


Ser 


Lys 


Asp 


Lys 


Pro 


Thr 


He 


Asp 


Val Lys 


Met 


Met 








325 










330 








335 




Asn 


Met 


Glu Ala 


Ala 


Asn 


Leu 


Ala 


Asp 


Val 


Arg 


Ser 


Tyr Cys 


Tyr 


Leu 






34=0 










345 








3S0 






Ala 


Ser 


Val Ser 


Asp 


Leu 


Ser 


Thr 


Arg 


Ala 


Ala 


Cys 


Pro T3nr 


Met 


Gly 






355 








360 










365 






Glu 


Ala 


His Asn 


Glu 


Lys 


Arg 


Ala 


Asp 


Pro 


Ala 


Phe 


Val Cys 


Lys 


Gin 




370 








375 










380 








Gly 


Val 


Val Asp 


Arg 


Gly 


Trp 


Gly 


Asn 


Gly 


Cys 


Gly 


Leu Plie 


Gly 


Lys 


385 
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Gly 


Ser 


He Asp 


Thr 


Cys 


Ala 


Lys 


Phe 


Ala 


Cys 


Thr 


Thr Lys 


Ala 


Thr 








405 










410 








415 
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Gly Trp 


lie 


He 


Gin Lys Glu 


Asn 




ys 


yr 


lu 


V3- 
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He 


Phe 
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Thr Thr Va.1 


G ^ 


er 


, 
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y 
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Thr 
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Ser 
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5 




















Lys Leu 


Gly 


Glu 


Tyr Gly Glu 


Val 


Thr 
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Asp 


Cys 


G 


Pro 
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Gly 


He 


Asp 


Thr 


Ser Ala Tyr 


Tyr 


Val 


Met 


Ser 


Val 


Gly 


Glu 


Lys 


Ser 
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Phe 


Leu 


Val 


His 


Arg Glu Trp 


Phe 


Met 


Asp 


Leu 


Asn 


Letx 


Pro 


Trp 


Ser 








500 
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Ser 


Ala 


Gly 


Ser 


Thr Thr Trp 


Arg 


Asn 


Arg 


Glu 


Thr 


Leva. 


Met 


Glu 


Phe 






515 






52 0 










52S 








Glu 


Glu 


Pro 


His 


Ala Thr Lys 


Gin 


Ser 


Val 


val 


Ala 


Leva. 


Gly 


Ser 


Gin 




530 






535 










540 










Glu 


Gly 


Ala 


Leu 


His Gin Ala 


Lew 


Ala 


Gly 


Ala 


He 


pro 


Val 


Glu 


Phe 


545 








550 








555 










560 


Ser 


Ser 


Asia 


Thr 


Val Lys Leu 


Thr 


Ser 


Gly 


His 


Leu 


Lys 


Cys 


Arg 


Val 










565 






570 










575 




Lys Met 


Glxx 


Lys 


Leu Gin Leu 


Lys 


Gly 


Thr 


Thr 


Tyr 


Gly 


Val 


Cys 


Ser 








580 






585 










590 






Lys 


Ala 


Phe 


Lys 


Phe Ala Arg 


Thrr 


Pro 


Ala 


Asp 


Thr 


Gly 


His 


Gly 


Thr 






595 






60O 










605 








Val 


Val 


Leva 


Glu 


Leu Gin Tyr 


Thxr 


Gly 


Thr 


Asp 


Gly 


Pro 


Cys 


Lys 


Val 




610 






615 










620 










Pro 


He 


Sexr 


Ser 


Val Ala Ser 


Leu 


Asn 


Asp 


Leu 


Thr 


Pro 


Val 


Gly 


Arg 


625 








630 








635 










64 0 


Leu 


Val 


Thr 


Val 


Asn Pro Phe 


Val. 


Ser 


Val 


Ala 


Thr 


Alsa 


Asn 


Ser 


Lys 










645 






650 
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Val 


Leu 


He 


Glu 


Leu Glu Pro 


Pro 


Phe 


Gly 


Asp 


Ser 


Tyror 


He 


Val 


Val 
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Gly Arg 


Gly Glu 


Gin Gin He 


Asn 


Hxs 


His 


Trp 


His 


Ly=s 


Ser 


Gly 


Ser 
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He 


Gly Lys 


Ala Phe Thr 


Thar 


Thr 


Leu 


Arg 


Gly 


Ala 


Gin 


Arg 


Leu 
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Ala 
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Gly 


Asp Thr Ala 
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Va 1 
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Gly 


Val 
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Thr 
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Val 


Gly Lys Ala 


He 


Hxs 


Gin 


Val 
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Gly 
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Arg 


Ser 
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Gly Gly Met 


Sear 


Trp 


He 


Thr 


Gin 


Gly 
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Gly 
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Trp Met Gly 


He 


Asn 
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Arg 
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Ar-g 


Ser 
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Ala 
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Met 


Thr 


Phe 


Leu 


Ala Val Gly 


Gly 


Val 


Leu 


Leu 


Phe 


Lfcu 


Ser 


Val 


Asn 
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Val 
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Asp 


Thr Gly Cys 
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He 


Asp 


He 


Gly 


Arrg 


Gin 


Glu 


Leu 
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795 
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Arg 


Cys 


Gly Ser 


Gly Val Phe 


He 


His 


Asn 


Asp 


Val 




Ala 


Trp 


Met 
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Asp 


Arg 


Tyac Lys 


Phe Tyr Pro 


Gl-u 


Thr 


Pro 


Gin 


Gly 


Leu. 


Ala 


Lys 


He 
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lie 


Gin 


Lys 


Ala 


His Ala Glu 


Gl 


Val 


Cys 


Gly 


Leu 




Ser 


Val 


Ser 






835 






84 0 










8^5 










Leu 


Glu 


His 


Gin Met Trp 


Glu 


Ala 


He 


Lys 


Asp 


Gl-u 


Leu 


Asn 


Thr 




850 
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Leu 


Leu 


Lys 


Glu 


Asn Gly Val 


Asp 


Leu 


Ser 


Val 


Val 


Val 


Glu 


Lys 


Gin 
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Lys Ala Ala 
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Leu 


Ala 


AX a 


Thr 
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Gly Trp Lys 
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Trp 


Gly 
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He 


Phe 


Ala 
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900 905 910 

Pro Glu Leu Ala Asn Asn Thr Phe Val He Asp Gly Pro Glu Thr Glu 

915 920 925 

Glu Cys Pro Thr Ala Asn Arg Ala Trp Asn Ser Met Glu Val Glu Asp 

930 935 940 

Phe Gly Phe Gly Leu Thr Ser Thr Arg Met Phe Leu. Arg lie Arg Glu 
945 950 955 960 

Thr Asn Thr Thr Glu Cys Asp Ser Lys He He Gly Thr Ala Val Lys 

965 970 975 

Asn Asn. Met Ala Val His Ser Asp Leu Ser Tyr Trrp He Glu Ser Gly 

980 985 990 

Leu Asn Asp Thr Trp Lys Leu. Glu Arg Ala Val Leu. Gly Glu Val Lys 

995 1000 1005 

Ser Cys Thr Trp Pro Glu Thr- His Thr Leu Trp Gly Asp Gly Val Leu 

1010 1015 102O 

Glu Ser Asp Leu He He Pro He Thr Leu Ala Gly Pro Arg Ser Asn 
1025 1030 1035 1040 

His Asn Arg Arg Pro Gly Tyrr Lys Thr Gin Asn Gin Gly Pro Trp Asp 

1045 1050 1055 

Glu Gly Arg Val Glu He Asp> Phe Asp Tyr Cys Pro Gly Thr Thr Val 

1060 1065 1070 

Thr lie Ser Asp Ser Cys Glu. His Arg Gly Pro Ala Ala Arg Thr Thr 

0.075 1080 1085 

Thr Glu. Ser Gly Lys Leu He Thr Asp Trp Cys Cys Arg Ser Cys Thr 

1090 1095 110O 

Leu Pro Pro Leu Arg Phe Gin Thr Glu Asn Gly Cys Trp Tyr Gly Met 
1105 1110 1115 1120 

Glu lie Arg Pro Thr Arg His Asp Glu Lys Thr Leu Val Gin Ser Arg 

1125 1130 1135 

Val Asn. Ala Tyr Asn Ala Asp Met He Asp Pro Phe Gin Leu Gly Leu 

1140 1145 1150 

Met Val Val Phe Leu Ala Thar Gin Glu Val Leu Arg Lys Arg Trp Thr 

1155 1160 1165 

Ala Lys He Ser He Pro Ala He Met Leu Ala Leu Leu Val Leu Val 

117 O 1175 118 0 

Phe Gly Gly He Thr Tyr Thar Asp Val Leu Arg Tyr Val He Leu Val 
1185 1190 1195 1200 

Gly Ala. Ala Phe Ala Glu Ala Asn Ser Gly Gly Asp Val Val His Leu 

1205 1210 1215 

Ala Leix Met Ala Thr Phe Lys He Gin Pro Val Ph_e Leu Val Ala Ser 

1220 1225 1230 

Phe Leix Lys Ala Arg Trp Thar Asn Gin Glu Ser He Leu Leu Met Leu 

1235 1240 1245 

Ala Ala Ala Phe Phe Gin Met Ala Tyr Tyr Asp AIL a Lys Asn Val Leu 

1250 1255 12S0 

Ser Trp Glu Val Pro Asp VaU. Leu Asn Ser Leu Ser Val Ala Trp Met 
1265 1270 1275 1280 

He Leu Arg Ala He Ser Phe Thr Asn Thr Ser Asn Val Val Val Pro 

1285 1290 1295 

Leu Leu. Ala Leu Leu Thr Pro Gly Leu Lys Cys Leu Asn Leu Asp Val 

1300 1305 1310 

Tyr Arg He Leu Leu Leu Met Val Gly Val Gly Ser Leu He Lys Glu 

1315 1320 1325 

Lys Arg Ser Ser Ala Ala Lys Lys Lys Gly Ala Cys Leu He Cys Leu 

1330 133 5 130=0 

Ala Leu Ala Ser Thr Gly Va.1 Phe Asn Pro Met He Leu Ala Ala Gly 
1345 1350 1355 1360 

Leu Met Ala Cys Asp Pro Asn Arg Lys Arg Gly Trp Pro Ala Thr Glu 

1365 1370 1375 

Val Met Thr Ala Val Gly L&u Met Phe Ala He Vail Gly Gly Leu Ala 
1380 1385 . 1390 
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Glu Leu Asp He Asp Ser Met Ala He Pro Met Thr He Ala Gly Leu 

1395 1400 1405 

Met Phe Ala Ala Phe Va.1 He Ser Gly Lys Ser" Thr Asp Met Trp He 

3.410 1415 1420 

Glu Arg Thr Ala Asp lie Thr Trp Glu Ser Asp> Ala Glu He Thr Gly 
1425 1430 1435 1440 

Ser Ser Glu Arg Val Asp Val Arg Leu Asp Asp» Asp Gly Asn Phe Gin 

1445 1450 1455 

Leu Met Asn Asp Pro GXy Ala Pro Trp Lys He Trp Met Leu Arg Met 

14S0 " 1465 1470 

Ala Cys Leu Ala He Ser Ala Tyr Thr Pro Trjp Ala He Leu Pro Sear 

1475 1480 1485 

Val He Gly Phe Trp He Thr Leu Gin Tyr Thrr Lys Arg Gly Gly Val 

3.490 " 1495 1500 

Leu Trp Asp Thr Pro Ser Pro Lys Glu Tyr Lys Lys Gly Asp Thr Thrr 
1505 153.0 1515 152 O 

Thr Gly Val Tyr Arg He Met Thr Arg Gly Leu Leu Gly Ser Tyr Gin 

1525 1530 1535 

Ala Gly Ala Gly Val Met Val Glu Gly Val Phe His Thr Leu Trp His 

1540 1545 1550 

Thr Thr Lys Gly Ala Ala Leu Met Ser Gly Glva Gly Arg Leu Asp Pro 

1555 1560 1565 

Tyr Trp Gly Ser Val Lys Glu Asp Arg Leu Cys Tyr Gly Gly Pro Trp 

3570 1575 1580 

Lys Leu Gin His Lys Tnrp Asn Gly His Asp Glia Val Gin Met He Val 
158 5 1590 1595 160 O 

Val Glu Pro Gly Lys Asn Val Lys Asn Val Gin Thr Lys Pro Gly Val 

1605 1610 1615 

Phe Lys Thr Pro Glu Gly Glu He Gly Ala Val Thr Leu Asp Tyr Pro 

1620 1625 1630 

Thr Gly Thr Ser Gly Ser Pro He Val Asp Lys Asn Gly Asp Val He 

1635 1640 1645 

Gly Leu Tyr Gly Asn Gly Val He Met Pro Asn Gly Ser Tyr He Ser 

1650 1655 1660 

Ala. He Val Gin Gly GDLu Arg Met Glu Glu Pro Ala Pro Ala Gly Ph.e 
1665 16-70 1675 168 0 

Glix Pro Glu Met Leu Arrg Lys Lys Gin He Thar Val Leu Asp Leu Hi s 

1685 1690 1695 

Pro Gly Ala Gly Lys Tlir Arg Lys He Leu Pro Gin He He Lys Gin 

1700 1705 1710 

Ala. He Asn Lys Arg Leu Arg Thr Ala Val Leu. Ala Pro Thr Arg Va.1 

1715 1720 1725 

Val Ala Ala Glu Met Ser Glu Ala Leu Arg Gly Leu Pro He Arg Tyr 

1730 1735 1740 

Gin. Thr Ser Ala Val His Arg Glu His Ser Gly Asn Glu He Val Asp 
1745 1750 1755 176 0 

Val Met Cys His Ala Tltr Leu Thr His Arg Leu Met Ser Pro His Arrg 

1765 1770 1775 

Val Pro Asn Tyr Asn Leu Phe He Met Asp Glu Ala His Phe Thr Asp 

1780 1785 1790 

Pro Ala Ser He Ala Ala Arg Gly Tyr He Ala Thr Lys Val Glu Leu 

1795 1800 1805 

Gly Glu Ala Ala Ala X le Phe Met Thr Ala Thnr Pro Pro Gly Thr Ser 

1810 1815 1820 

Asp. Pro Phe Pro Glu S er Asn Ala Pro He Sex Asp Met Gin Thr Glu 
1825 1830 1835 184=0 

He Pro Asp Arg Ala T"rp Asn Thr Gly Tyr Glxi Trp He Thr Glu Tyrr 

1845 1850 1855 

Val Gly Lys Thr Val Trp Phe Val Pro Ser Val Lys Met Gly Asn Glu 

1860 1865 1870 

He Ala Leu Cys Leu GJ-ln Arg Ala Gly Lys Lys Val He Gin Leu Asn 
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1875 1880 1885 

Arg Lys Ser Tyr Glu Thr Glu Tyr Pro Lys Cys Lys Asn Asp Asp Trp 

1890 1895 1900 

Asp Phe Val He Thr Thr Asp He Ser Glu. Met Gly Ala Asn Pile Lys 
1905 1910 1915 1920 

Ala Ser Arg Val Xle Asp Ser Arg Lys Sex: Val Lys Pro Thr I He He 

1925 193 0 19 3 5 

Glu Glu Gly Asp Gly Arg Val He Leu Gly Glu Pro Ser Ala IHe Thr 

1940 1945 1950 

Ala Ala Ser Ala Ala Gin Arg Arg Gly Arg He Gly Arg Asn P:ro Ser 

1955 I960 1965 

Gin Val Gly Asp Glu Tyr Cys Tyr Gly Gly His Thr Asn Glu Asp Asp 

1970 1975 1980 

Ser Asn Phe Ala His Trp Thr Glu Ala Arg He Met Leu Asp Asn He 
1985 1990 1995 2000 

Asn Met Pro Asn Gly Leu Val Ala Gin leu Tyr Gin Pro Glu Arrg Glu 

2O05 201O 20 15 

Lys Val Tyr Thr Met Asp Gly Glu Tyr Arg Leu Arg Gly Glu Glu Arg 

2020 2025 2030 

Lys Asn Phe Leu Glu Phe Leu Arg Thr Ala Asp Leu Pro Val Tzrp Leu 

2035 2040 2045 

Ala Tyr Lys Val Ala Ala Ala Gly He Sear Tyr His Asp Arg Lys Trp 

2050 2055 2060 

Cys Phe Asp Gly P>ro Arg Thr Asn Thr He Leu Glu Asp Asn A_sn Glu 
2065 2070 2075 2080 

Val Glu Val He Thr Lys Leu Gly Glu Arg Lys He Leu Arg P xo Arg 

2085 209O 20 95 

Trp Ala Asp Ala Arg Val Tyr Ser Asp His Gin Ala Leu Lys Ser Phe 

2100 2105 2110 

Lys Asp Phe Ala Ser Gly Lys Arg Ser Gin He Gly Leu Val G-lu Val 

2115 2120 2125 

Leu Gly Arg Met Pro Glu His Phe Met Val Lys Thr Trp Glu A_la Leu 

2130 2135 2140 

Asp Thr Met Tyr "Val Val Ala Thr Ala Glxi Lys Gly Gly Arg AHa His 
2145 2150 2155 2160 

Arg Met Ala Leu Glu Glu Leu Pro Asp Ala. Leu Gin Thr He Val Leu 

2 165 217 O 2175 

He Ala Leu Leu Ser Val Met Ser Leu Gly Val Phe Phe Leu Leu Met 

2180 2185 2190 

Gin Arg Lys Gly lie Gly Lys He Gly Leu Gly Gly Val He Leu Gly 

2195 " 2200 2205 

Ala Ala Thr Phe Phe Cys Trp Met Ala Glxi Val Pro Gly Thr L.ys He 

2210 2215 2220 

Ala Gly Met Leu Leu Leu Ser Leu Leu Leu Met He Val Leu He Pro 
2225 2230 2235 2240 

Glu Pro Glu Lys Gin Arg Ser Gin Thr Asp Asn Gin Leu Ala Val Phe 

2245 2250 22 55 

Leu He Cys Val Leu Thr Leu Val Gly Ala Val Ala Ala Asn Slu Met 

2260 2265 2270 

Gly Trp Leu Asp Lys Thr Lys Asn Asp He Gly Ser Leu Leu Sly His 

2275 2280 2285 

Arg Pro Glu Ala Arg Glu Thr Thr Leu Gly Val Glu Ser Phe L.eu Leu 

2290 2295 2300 

Asp Leu Arg Pro -Ala Thr Ala Trp Ser Leu Tyr Ala Val Thr T7hr Ala 
2305 2310 2315 2320 

Val Leu Thr Pro Leu Leu Lys His Leu lie Thr Ser Asp Tyr IHe Asn 

2325 233 0 2335 

Thr Ser Leu Thr Ser He Asn Val Gin A3, a Ser Ala Leu Phe T?hr Leu 

2340 2345 2350 

Ala Arg Gly Phe Pro Phe Val Asp Val Gly Val Ser Ala Leu LLeu Leu 
2355 2360 2365 
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Ala Val Gly Cys Trp Gly Gin Val Ttxr Leu Thr Val Thr Val Thr Ala 

2370 2375 2380 

Ala Ala Leu Leu Phe Cys His Tyr Ala Tyr Met Val Pro Gtly Trp Gin 
2385 2390 2395 2400 

Ala Glu Ala Met Arg Ser Ala Gin Arg Arg Thr Ala Ala Q-ly lie Met 

2405 2410 2415 

Lys Asn Val Val Val Asp Gly lie Val Ala Thr Asp Val F»ro Glu Leu 

242 O 2425 24=30 

Glu Arg Thr Thtr Pro Val Met Gin Lys Lys Val Gly Gin E le lie Leu 

2435 2440 2445 

He Leu Val Ser Met Ala Ala Val Val Val Asn Pro Ser Val Arg Thr 

2450 2455 2460 

Val Arg Glu Ala Gly He Leu Thr Ttir Ala Ala Ala Val T"hr Leu Trp 
2465 2470 2475 2480 

Glu Asn Gly Ala Ser Ser Val Trp Asn Ala Thr Thr Ala rie Gly Leu 

2485 2490 2495 

Cys His He Met Arg Gly Gly Trp Leu Ser Cys Leu Ser THe Met Trp 

250 O ' 2505 25-10 

Thr Leu He Lys Asn Met Glu Lys Pro Gly Leu Lys Arg Sly Gly Ala 

2515 2520 2525 

Lys Gly Arg Thr Leu Gly Glu Val Trp Lys Glu Arg Leu Av.sn His Met 

2530 2535 2540 

Thr Lys Glu GItj. Phe Thr Arg Tyr Arg Lys Glu Ala He Thr Glu Val 
2545 2550 2555 2560 

Asp Arg Ser Ala. Ala Lys His Ala Arg Arg Glu Gly Asn Hie Thr Gly 

2565 2570 2575 

Gly His Pro Va.1 Ser Arg Gly Thr Ala Lys Leu Arg Trp Leu Val Glu 

258 O 25S5 2590 

Arg Arg Phe Leu Glu Pro Val Gly Lys Val Val Asp Leu Sly Cys Gly 

2595 2600 2605 

Arg Gly Gly Trjp Cys Tyr Tyr Met Ala Thr Gin Lys Arg Val Gin Glu 

2610 2615 2620 

Val Lys Gly Tyr Thr Lys Gly Gly Pro Gly His Glu Glu F*ro Gin Leu 
2625 2630 2635 2640 

Val Gin Ser Tyr Gly Trp Asn He Val Thr Met Lys Ser Sly Val Asp 

2645 2650 2655 

Val Phe Tyr Arg Pro Ser Glu Ala Ser Asp Thr Leu Leu Cys Asp He 

266 O 2665 2570 

Gly Glu Ser Sear Ser Ser Ala Glu Val Glu Glu His Arg TThr Val Arg 

2675 2680 2685 

Val Leu Glu Met: Val Glu Asp Trp Leu His Arg Gly Pro L.ys Glu Phe 

2690 2695 2700 

Cys He Lys Va.3. Leu Cys Pro Tyr Met Pro Lys Val He Slu Lys Met 
2705 2710 2715 2720 

Glu Thr Leu Gin Arg Arg Tyr Gly Gly Gly Leu He Arg &sn Pro Leu 

2725 2730 2735 

Ser Arg Asn Ser Thr His Glu Met Tyr Trp Val Ser His Ser Gly 

27 4 0 27 4 5 27'50 

Asn He Val Hi. s Ser Val Asn Met Ttir Ser Gin Val Leu L.eu Gly Arg 

2755 2760 2765 

Met Glu Lys Lys Thr Trp Lys Gly Pro Gin Phe Glu Glu iKsp Val Asn 

2770 2775 2780 

Leu Gly Ser Gly Thr Arg Ala Val Gly Lys Pro Leu Leu Avsn Ser Asp 
2785 2790 2795 2800 

Thr Ser Lys He Lys Asn Arg He Glu Arg Leu Lys Lys Slu Tyr Ser 

2805 2810 2815 

Ser Thr Trp His Gin Asp Ala Asn His Pro Tyr Arg Thr T7rp Asn Tyr 

282 0 2825 2830 

His Gly Ser Tyr Glu Val Lys Pro Ttir Gly Ser Ala Ser Ser Leu Val 

2835 2840 2845 

Asn Gly Val Va.1 Arg Leu Leu Ser Lys Pro Trp Asp Thr Ele Thr Asn 
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2850 2855 2860 

Val Thr Thr- Met Ala Met Thr Asp Thr Thr Pro Phe Sly Gin Gin Arg 
2865 2870 2875 2880 

Val Phe Lys Glu Lys Val Asp Thr- Lys Ala Pro Glu Pro Pro Glu Gly 

2885 2890 2895 

Val Lys Tyr- Val Leu Asn Glu Thr- Thr Asn Trp Leu Trp Ala Phe Leu 

2900 2905 2910 

Ala Arg Asp Lys Lys Pro Arg Met Cys Ser Arg Glu Slu Phe He Gly 

2915 2920 2325 

Lys Val Asn Ser Asn Ala Ala Leu. Gly Ala Met Phe Slu Glu Gin Asn 

2930 2935 2940 

Gin Trp Lys Asn Ala Arg Glu Ala. Val Glu Asp Pro Lys Phe Trp Glu 
2945 2950 2955 2960 

Met Val Asp Glu Glu Arg Glu Ala, His Leu Arg Gly Slu Cys Asn Thr 

2965 2970 2975 

Cys He Tyr Asn Met Met Gly Lys Arg Glu Lys Lys Pro Gly Glu Phe 

2980 2985 2990 

Gly Lys Ala Lys Gly Ser Arg Ala. He Trp Phe Met Trp Leu Gly Ala 

2995 300O 3O05 

Arg Phe Leu Glu Phe Glu Ala Leu. Gly Phe Leu Asn Slu Asp His Trp 

3010 3015 3020 

Leu Gly Arg- Lys Asn Ser Gly Gly Gly Val Glu Gly ILeu Gly Leu Gin 
3025 3030 3035 3040 

Lys Leu Gly Tyr He Leu Lys Glu. Val Gly Thr Lys Pro Gly Gly Lys 

3045 3050 3055 

Val Tyr Ala. Asp Asp Thr Ala Gly Trp Asp Thr Arg Xle Thr Lys Ala 

3060 3065 3070 

Asp Leu Glu. Asn Glu Ala Lys Val Leu Glu Leu Leu Asp Gly Glu His 

3075 308O 3085 

Arg Arg Leu Ala Arg Ser He He Glu Leu Thr Tyr Arg His Lys Val 

3090 ■ 3095 3100 

Val Lys Val Met Arg Pro Ala Ala. Asp Gly Lys Thr \7al Met Asp Val 
3105 3110 3115 3120 

He Ser Arg Glu Asp Gin Arg Gly Ser Gly Gin Val \7al Thr Tyr Ala 

3125 3130 3135 

Leu Asn Thr Phe Thr Asn Leu Ala Val Gin Leu Val Arg Met Met Glu 

3140 3145 3150 

Gly Glu Gly Val He Gly Pro Asp> Asp Val Glu Lys ILeu Gly Lys Gly 

3155 3160 33.65 

Lys Gly Pro Lys Val Arg Thr Trp Leu Phe Glu Asn Gly Glu Glu Arg 

3170 3175 3180 

Leu Ser Arg Met Ala Val Ser Gly Asp Asp Cys Val X7al Lys Pro Leu 
3185 3190 3195 3200 

Asp Asp Arg Phe Ala Thr Ser Leu His Phe Leu Asn Ala Met Ser Lys 

3205 3210 3215 

Val Arg Lys Asp He Gin Glu Trp. Lys Pro Ser Thr Gly Trp Tyr Asp 

3220 3225 3230 

Trp Gin Gin Val Pro Phe Cys Sear Asn His Phe Thr Glu Leu He Met 

3235 3240 3245 

Lys Asp Gly" Arg Thr Leu Val Val Pro Cys Arg Gly Gin Asp Glu Leu 

3250 ~ 3255 3260 

He Gly Arg Ala Arg He Ser Pro Gly Ala Gly Trp Asn Val Arg Asp 
3265 3270 3275 3280 

Thr Ala Cys Leu Ala Lys Ser Tyzr Ala Gin Met Trp Leu Leu Leu Tyr 

3285 3290 3295 

Phe His Arg Arg Asp Leu Arg Leu Met Ala Asn Ala Hie Cys Ser Ala 

3300 3305 3310 

Val Pro Ala Asn Trp 1 Val Pro Thr Gly Arg Thr Thr Trp Ser He His 

331 5 332 0 3 3 25 

Ala Lys Gly Glu Trp Met Thr Thar Glu Asp Met Leu Ala Val Trp Asn 
3330 3335 3340 



22 



WO 2004/016586 



PCT7TJS2003/025681 



Arg Val Trp lie Glu Glu Asn Glu Trp Met Glu A_sp Lys Thr Pro Val 
3345 ' 3350 3355 3360 

Glu Arg Trp Ser Asp Val E*ro Tyr Ser Gly Lys A-rg Glu Asp He Trp 

3365 3370 3375 

Cys G-ILy Ser Leu He Gly Thr Arg Thr Arg Ala "Flir Trp Ala Glu Asn 

3380 3385 3390 

He His Val Ala He Asn Gin Val Arg Ser Val He Gly Glu Glu Lys 

3395 3400 3405 

Tyr Val Asp Tyr Met Ser Ser Leu Arg Arg Tyr G-lu Asp Thr He Val 

34 10 34=15 34=20 

Val Glu Asp Thr Val Leu 
3425 3430 



<210> 3 

<211> 497 

<212> PRT 

<213> West Nile virus 



<40O> 3 

Phe Asn Cys Leu Gly Met Ser Asn Arg Asp Phe I_.eu Glu Gly Val Ser 
1 5 10 15 

Gly Ala Thr Trp Val Asp Leu Val Leu Glu Gly Asp Ser Cys Val Thr 
20 25 30 

He Met Ser Lys Asp Lys I>ro Thr He Asp Val r_iys Met Met Asn Met 

35 40 45 

Glu Ala Ala Asn Leu Ala Asp Val Arg Ser Tyr Cys Tyr Leu Ala Ser 
50 55 SO 

Val Ser Asp Leu Ser Thr Arg Ala Ala Cys Pro Thr Met Gly Glu Ala 
65 70 75 80 

His Asn Glu Lys Arg Ala Asp Pro Ala Phe Val Cys Lys Gin Gly Val 
85 90 95 

Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu E>he Gly Lys Gly Ser 
100 105 110 

lie Asp Thr Cys Ala Lys !Phe Ala Cys Thr Thr Dys Ala Thr Gly Trp 
115 120 125 

He lie Gin Lys Glu Asn He Lys Tyr Glu Val ^Ha He Phe Val His 
130 135 IL40 

Gly E>ro Thr Thr Val Glu Ser His Gly Lys He Gly Ala Thr Gin Ala 
145 150 155 160 

Gly Arg Phe Ser He Thr Pro Ser Ala Pro Ser "Tyr Thr Leu Lys Leu 
165 170 175 

Gly Glu Tyr Gly Glu Val Thr Val Asp Cys Glu Pro Arg Ser Gly He 
180 185 190 

Asp Thr Ser Ala Tyr Tyr Val Met Ser Val Gly Glu Lys Ser Phe Leu 
195 " 200 205 

Val His Arg Glu Trp Phe Met Asp Leu Asn Leu Pro Trp Ser Ser Ala 
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Gly Ser Thr Thr Trp Arg Asn Arg Glu Thr Leu Met Glu Phe Glu Glu 
225 230 235 240 

Pro His Ala Thr Lys Gin Ser Val Val Ala Leu Gly Ser Gin Glu 3-ly 
245 250 255 

Ala Leu His Gin Ala Leu Ala Gly Ala He Pzr/o Val Glu Phe Ser Ser 
260 265 270 

A-sn Thr Val Lys Leu Thr Ser Gly His Leu Lys Cys Arg Val Lys Wet 
275 280 285 

Glu Lys Leu Gin Leu Lys Gly Thr Thr Tyr Sly Val Cys Ser Lys Ala 
290 295 300 

Phe Lys Phe Ala Arg Thr Pro Ala Asp Thr Gly His Gly Thr Val VTal 
3 05 310 3 15 320 

Leu Glu Leu Gin Tyr Thr Gly Thr Asp Gly Pro Cys Lys Val Pro He 
325 330 " 335 

Ser Ser Val Ala Ser Leu Asn Asp Leu Thr Pro Val Gly Arg Leu ^7al 
340 345 350 

Thr Val Asn Pro Phe Val Ser Val Ala Thr Ala Asn Ser Lys Val Xeu 
355 360 3S5 

He Glu Leu Glu Pro Pro Phe Gly Asp Ser Tyr lie Val Val Gly JArq 
37 0 375 380 

Gly Glu Gin Gin Xle Asn His His Trp His Lys Ser Gly Ser Ser He 
385 390 395 " 400 

Gly Lys Ala Phe Thrr Thr Thr Leu Arg Gly Ala Gin Arg Leu Ala Ala 
405 410 415 

Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser "Val Gly Gly Val Phe Thr 
42 ° 425 430 

Ser Val Gly Lys Ala He His Gin Val Phe Gly Gly Ala Phe Arg Ser 
435 440 445 

Leu Phe Gly Gly Met Ser Trp He Thr Gin Gly Leu Leu Gly Ala Leu 
450 455 460 

Leu Leu Trp Met Gly He Asn Ala Arg Asp Arg Ser He Ala Met Thr 
465 470 475 480 

Phe Leu Ala Val Gly Gly Val Leu Leu Phe Leu Ser Val Asn Val His 
485 490 495 



<210> 4 
<211> 100 
<212> PRT 

<213> Plavi virus sp. 
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<400> 4. 

Lys Gly Val Ser Tyr Val Met Cys Thr Gly Ser Phe Lys 3Deu Glu Lys 
15 io 15 

Glu Val Ala Glu Thr Gin His Gly Thr Val Leu Val Gin "Val Lys Tyr 
20 25 30 

Glu Gly Thr Asp Ala Pro Cys Lys He Pro Phe Ser Ser Gin Asp Glu 
35 40 45 

Lys Gly Val Thr Gin Asn Gly Arg Leu lie Thr Ala Asn Pro lie Val 
50 55 60 

lie Asp Lys Glu Lys Pro Val Asn He Glu Ala Glu Pro Pro Phe Gly 
65 70 75 80 

Glu Ser Tyr lie Val Val Gly Ala Gly Glu Lys Ala Leu Lys Leu Ser 
85 so 95 

Trp Phe Lys Lys 
100 



<210> 5 
<211> 100 
<212> PRT 

<213> Flavivirus sp . 
<400> 5 

Lys Gly Met Ser Tyr Ala Met Cys Leu Asn Thr Phe Val Leu Lys Lys 
15 io 15 

Glu Val Ser Glu Thr Gin His Gly Thr He Leu lie Lys Val Glu Tyr 
20 25 30 

Lys Gly Glu Asp Ala Pro Cys Lys He Pro Phe Ser Thr- Glu Asp Gly 
35 4 0 45 

Gin Gly Lys Ala His Asn Gly Arg Leu He Thr Ala Asn. Pro Val Val 
50 55 60 

Thr Lys Lys Glu Glu Pro Val Asn. He Glu Ala Glu Pro Pro Phe Gly 
65 70 75 80 

Glu Ser Asn He Val He Gly He Gly Asp Lys Ala Leu. Lys He Asn 
85 90 95 

Trp Tyr Arg Lys 
100 



<210> 6 
<211> 100 
<212> PRT 

<213> Flavivirus sp. 
<400> 6 

Lys Gly Met Ser Tyr Ser Met Cys 
1 5 



Thr Gly Lys Phe Lys Val Val Glu 
10 15 
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Glu He Ala Glu Thr Gin His Gly Thr He Val ICle Arg Val Gin Tyr 
20 25 3 0 

Glu Gly Asp Gly Ser Pro Cys Lys He Pro Leu Glu He Met Asp Leu 
35 40 45 

Asp As n Arg His Val Leu Gly Arg Leu He Thr Aral Asn Pro He Val 
50 55 60 

Thr Glu Lys Asp Ser Pro Val Asn Val Glu Ala Olu Pro Pro Leu Gly 
65 70 75 80 

Asp Ser Tyr He He He Gly Val Glu Pro Gly Gin Leu Lys Leu Asn 
85 so 95 

Trp Plae Lys Lys 
100 

<210> 7 
<211> 99 
<212> PRT 

<213> Flavivirus sp. 
<40O> 7 

Lys Gly Met Ser Tyr Thr Met Cys Ser Gly Lys Phe Ser He Asp Lys 
15 io 15 

Glu Met Ala Glu Thr Gin His Gly Thr Thr Val Val Lys Val Lys Tyr 
20 25 30 

Glu <31y Ala Gly Ala Pro Cys Lys Val Pro He Glu He Arg Asp Val 
35 40 45 

Asn Hys Glu Lys Val Val Sly Arg He He Ser Ser Thr Pro Leu Ala 
50 55 60 

Glu Asn Thr Asn Ser Val Thr Asn He Glu Leu Glu Arg Pro Leu Asp 
65 ™ 75 80 

Ser Tyr He Val He Gly Val Gly Asn Ser Ala Leu Thr Leu His Trp 
85 go 95 

Phe Z\rg Lys 

<210=. 8 
<211=> 103 
<212=. PRT 

<213=- Flavivirus sp. 
<40O> 8 

Lys Gly Thr Thr Tyr Gly Met Cys Thr Glu Lys Phe Ser Phe Ala Lys 
15 10 15 

Asn Pro Ala Asp Thr Gly His Gly Thr Val Val He Glu Leu Ser Tyr 
20 25 30 

Ser Gly Ser Asp Gly Pro Cys Lys He Pro He Val Ser Val Ala Ser 
35 40 45 
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Leu Asn Asp Met Thr Pro Val Gly AJrg Leu Val Thr Val Asia Pro Phe 
50 55 60 

Val Ala Thr Seir Ser Ala Asn Ser Lys Val Leu Val Glu Met Glu Pro 
65 70 75 80 

Pro Phe Gly Asrp Ser Tyr lie Val Val Gly Arg Gly Asp Lys Gin He 
85 90 95 

Asn His His Trp His Lys Ala 
10 O 

<210> 9 
<211> 103 
<212> PRT 

<213> Flavivir-us sp. 
<400> 9 

Lys Gly Thr Th.r Tyr Gly Met Cys Tlir Glu Lys Phe Thr Plxe Ser Lys 
1 5 io 15 

Asn Pro Ala Asp Thr Gly His Gly Ttir Val Val Leu Glu Leu Gin Tyr 
20 25 30 

Thr Gly Ser Asp Gly Pro Cys Lys lie Pro He Ser Ser Val Ala Ser 
35 40 45 

Leu Asn Asp Met Thr Pro Val Gly JVrg Met Val Thr Ala Asn Pro Tyr 
50 55 so 

Val Ala Ser Ser Thr Ala Asn Ala Lys Val Leu Val Glu lHe Glu Pro 
65 70 75 80 

Pro Phe Gly Asp Ser Tyr He Val Val Gly Arg Gly Asp L^s Gin He 
85 90 95 

Asn His His T:rp His Lys Glu 
1 00 

<210> 10 
<211> 103 
<212> PRT 

<213> Flavivi.rus sp. 
<400> 10 

Lys Gly Thr Thr Tyr Gly Val Cys Ser Lys Ala Phe Arg E=he Leu Gly 
1 5 io 15 

Thr Pro Ala i^sp Thr Gly His Gly Thr Val Val Leu Glu H,eu Gin Tyr 
2° 25 30 

Thr Gly Thr Z^sp Gly Pro Cys Lys He Pro He Ser Ser Val Ala Ser 
35 40 45 

Leu Asn Asp lieu Thr Pro Val Gly Arg Leu Val Thr Val Asn Pro Phe 
50 55 60 

Val Ser Val Ser Thr Ala Asn Ala Lys Val Leu He Glu Leu Glu Pro 
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Pro Phe Gly Asp Ser Tyr He Val Val Gly Arg Gly Glu Gin Gin He 



<210> 11 
<211> 103 
<2X2> PRT 

<213> West Nile virus 
<40 0> 11 

Lys Gly Thr Thr Tyr Gly Val Cys Ser Lys Ala. Phe Lys Phe Leu Gly 
1- 5 10 15 

Thrr Pro Ala Asp Thr Gly- His Gly Thr Val Val. Leu Glu Leu Gin Tyr 
20 25 30 

Thr Gly Thr Asp Gly Pro Cys Lys Val Pro He Ser Ser Val Ala Se r 
35 40 45 

Leu Asn Asp Leu Thr Pro Val Gly Arg Leu Val Thr Val Asn Pro Phte 
50 55 go 

VaU. Ser Val Ala Thr Ala Asn Ala Lys Val Leu. He Glu Leu Glu Prro 
55 70 75 SO 

Pro Phe Gly Asp Ser Tyr He Val Val Gly Arg Gly Glu Gin Gin Il_e 
85 90 95 

Asn His His Trp His Lys Ser 
100 

<2 10> 12 
<211> 103 
<2 12> PRT 

<2 13> Flavi virus sp. 
<4=00> 12 

Lys Gly Thr Thr Tyr Gly Met Cys Asp Ser Al_a Phe Thr Phe Ser Lys 
15 10 15 

Asn Pro Thr Asp Thr Gly His Gly Thr Val He Val Glu Leu Gin T-yr 
20 25 30 

Thr Gly Ser Asn Gly Pro Cys Arg Val Pro He Ser Val Thr Ala Av.sn 
35 40 45 

Leu Met Asp Leu Thr Pro Val Gly Arg Leu Val Thr Val Asn Pro F>he 
50 55 60 

lie Ser Thr Gly Gly Ala Asn Asn Lys Val Met He Glu Val Glu Pro 
6 5 70 -75 80 

Pro Phe Gly Asp Ser Tyr He Val Val Gly Arg Gly Thr Thr Gin He 
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Asn Tyr His Turp His Lys Glu 

100 



<210> 13 
<211> 100 
<212> PRT 

<213> Flaviviarus sp. 



<400> 13 

Lys Gly Thr Ser Tyr Lys Met Cys Thr Asp Lys Met Ser ©he Val Lys 
15 io 15 

Asn Pro Thr A.sp Thr Gly His Gly Thr Ala Val Met Gin X7al Lys Val 
20 25 30 

Pro Lys Gly AJ.a Pro Cys Arg He Pro Val Met Val Ala ^sp Asp Leu 
35 40 45 

Thr Ala Ser Val Asn Lys Gly He Leu Val Thr Val Asn 3>ro He Ala 
50 55 so 

Ser Thr Asn Glu Asp Glu Val Leu He Glu Val Asn Pro Pro Phe Gly 
65 70 75 80 

Asp Ser Tyr Xle He Val Gly Thr Gly Asp Ser Arg Leu Thr Tyr Gin 
85 90 95 

Trp His Lys Glu 
3.00 

<210> 14 
<211> 96 
<212> PRT 

<213> Flavivirus sp. 
<400> 14 

Lys Gly Leu Thr Tyr Thr Met Cys Asp Lys Thr Lys Phe Thr Trp Lys 
15 io 15 

Arg Ala Pro Thr Asp Ser Gly His Asp Thr Val Val Met Glu Val Thr 
20 25 30 

Phe Ser Gly Thr Lys Pro Cys Arg He Pro Val Arg Ala Val Ala His 
35 40 45 

Gly Ser Pro Asp Val Asn Val Ala Met Leu He Thr Pro Asn Pro Thr 
50 55 60 

He Glu Asn Asn Gly Gly Gly Phe He Glu Met Gin Leii Pro Pro Gly 
65 70 75 80 

Asp Asn He He Tyr Val Gly G1"U Leu Ser Tyr Gin Trp> Phe Gin Lys 



<210> 15 
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<211> 96 
<212> PRT 

<213> Flavivirus sp. 
<4O0> 15 

Lys Gly Met Thr Tyr Thr Val Cys Glu Gly Ser Lys Phe Ala Trp L,ys 
1 5 10 IS 

Arg Pro Pro Thr Asp Ser Gly His Asp Thr Val Val Met Glu Val Thr 
20 25 30 

Tyr Thr Gly Ser Lys jE>ro Cys Arg He Pro Val Arg Ala Val Ala His 
35 40 45 

Gly Glu Pro Asn Val Asn Val Ala Ser Leu XJLe Thr Pro Asrx Pro Ser 
50 55 60 

Met Glu Asn Thr Gly Gly Gly Phe Val Glu Leu Gin Leu Pro Pro Gly 
65 70 -75 80 

Asp Asn He lie Tyr Val Gly Glu Leu Ser His Gin Trp Phe Gin lys 
85 90 " 95 



<210> 16 
<211> 96 
<212> PRT 

<213> Flavivirus sp _ 
<400> 16 

Lys Gly Leu Thr Tyr Thr Met Cys Asp Lys Thr Lys Phe Thr Trp Lys 
15 io 15 

Aorg Ala Pro Thr Asp Ser Gly His Asp Thr Val Val Met Glu Val Thr 
20 25 30 

Phe Ser Gly Thr Lys Pro Cys Arg He Pro Val Arg Ala Val Ala His 
35 40 45 

Glly Ser Pro Asp Val Asn Val Ala Met Leu Hie Thr Pro Asn Pro Thr 
50 55 60 

He Glu Asn Asn Gly Gly Gly Phe He Glu Met Gin Leu Pro Pro Gly 
65 70 75 80 

Asp Asn He He Tyrr Val Gly Glu Leu Ser His Gin Trp Phe Gin Lys 



<210> 17 
<211> 96 
<212> PRT 

<213> Flavivirus sp - 
<400> 17 
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Lys Gly Leu Thr Tyr Thr Met Cys 
1 5 

Arg Thr Pro Thr Asp ser Gly His 
20 

Phe Ser Gly Ser Lys Pro Cys Ar/g 
35 4 0 

Gly Ser Prtro Asp Val Asn Val Ala 
50 55 

He Glu Asn Asp Gly Gly Gly Prxe 
65 70 

Asp Asn lie He Tyr Val Gly Glu 
85 



Asp Lys ser Lys P3ae Ala Trp Lys 
10 15 

Asp Thr Val Val MIet Glu Val Thr 
25 30 

He Pro Val Arg A.la Val Ala His 
45 

Met Leu He Thr F>ro Asn Pro Thr 
60 

He Glu Met Gin Leu Pro Pro Gly 
75 80 

Leu Ser His Gin Trp Phe Gin Thr 
90 95 



<210> 18 
<211> 96 
<212> PRT 

<213> Flat vi virus sp. 
<400> 18 

Lys Gly Leu Thr Tyr Thr Val Cys Asp Lys Thr Lys Phe Thr Trp Lys 
1 5 io 15 

Arg Ala Pro Thr Asp Ser Gly His Asp Thr Val Val Met Glu Val Gly 
20 25 30 

Phe Ser Gly Thr Arg Pro Cys JVrg He Pro Val Arg Ala Val Ala His 
35 40 45 

Gly Val I>ro Glu Val Asn Val A.la Met Leu He Thr Pro Asn Pro Thr 
50 55 60 

Met Glu Asn Asn Gly Gly Gly E>he He Glu Met Gin Leu Pro Pro Gly 
65 70 75 80 

Asp Asn He He Tyr Val Gly Asp Leu Asn Tyr Gin Trp Phe Gin Lys 



<210> 19 
<211> 96 
<212> PRT 

<213> Flavi virus sp . 
<400> 19 

Lys Gly Leu Thr Tyr Thr Met Cys 
1 5 

Arg Ala Pro Thr Asp Ser Gly His 
20 



Asp Lys Ala Lys Phe Thr Trp Lys 
10 15 

Asp Thr Val Val_ Met Glu Val Ala 
25 30 
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Phe Ser Gly Thr Lys Pro Cys Arg 
35 40 

Gly Ser Pro Asp Val Asp Val Ala 
50 55 

He Glu Asn Asn Gly Gly Gly Phe 
65 70 

Asp Asn He He Tyar Val Gly Glu 
85 



He Paro Val Arg Ala Val Ala His 
45 

Met Leu He Thr Pro Asn Pro Thr 
60 

He Glu Met Gin Leu Pro Pr<=> Gly 
75 ao 

Leu Irys His Gin Trp Phe Glm Lys 
90 9S 



<210> 20 

<211> 97 

<212> PRT 

<213> Flavivirus sp. 

<400> 20 

Lys Gly Thr Thr Tyr Ser Met Cys Asp r_,ys Ala Lys Phe Lys Trrp Lys 
1 5 10 -1.5 

Arg Val Pro Val Asp Ser Gly His Asp Thr Val Val Met Glu Val Ser 
20 25 30 

Tyr Thr Gly Ser Asp Lys Pro Cys Arg Hie Pro Val Arg Ala Val Ala 
35 40 45 

His Gly Val Pro Ala Val Asn Val Ala rMet Leu He Thr Pro Asn Pro 

50 55 so 

Thr He Glu Thr Asn Gly Gly Gly Phe rrle Glu Met Gin Leu Pnro Pro 
65 70 75 80 

Gly Asp Asn He lie T yr Val Gly Asp Xeu Ser Gin Gin Trp Ptie Gin 
85 90 S5 

Lys 



<210> 21 
<211> 111 
<212> PRT 
<213> West 



Wile virus 



<400> 21 

Gin Leu Lys Gly Thr Thr Tyr Gly 
1 5 

Leu Gly Thr Pro ivia Asp Thr Gly 
20 

Gin Tyr Thr Gly Thr Asp Gly Pro 
35 40 

Ala Ser Leu Asn J^sp Leu Thr Pro 



Val Cys Ser Lys Ala Phe L»ys Phe 
10 15 

His Gly Thr Val Val Lsu Slu Leu 
25 30 

Cys Lys Val Pro He Ser Ser Val 
45 

Val Gly Arg Leu Val Thr Val Asn 
32 
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50 53 60 

Pro Phe Val Ser Val Ala Thr Ala Asn Ala Lys Val Leu lie Glu Leu 
65 70 75 ao 

Glu Pro Pro Phe Gly Asp Ser Tyr He Val Val GUy Arg Gly Glu Gin 

85 .90 95 

Gin He Asn His His Trp His Lys Ser Gly Ser Ser He Gly Lys 

ioo 105 12 J 

<210> 2 2 
<211> 2 0 
<212> DUA 

<213> Artificial Sequence 
<220> 

<223> Description of Artif acial Sequence: Synthetic 
Primer 

<400> 22 

tgcatcaagc tttggctgga 



<210> 23 
<211> 20 
<212> EDNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artif: icial Sequence: Synthetic 
Primer 

<400> 23 

tcttgcoggc tgatgtctat 



<210> 24 
<211> 20 
<212> X)NA 

<213> Artificial Sequence 
<220> 

<223> description of Artificial Sequence: Synthetic 
Primer 

<400> 24 

tgcaccaagc tctggccgga 



<210> 25 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: S-vntheti 
Primer 



<400> 25 
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